日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

问题: return unicode(text, encoding, errors=errors) UnicodeDecodeError: ‘utf-8‘ codec can‘t decode

發布時間:2024/3/12 编程问答 44 豆豆
生活随笔 收集整理的這篇文章主要介紹了 问题: return unicode(text, encoding, errors=errors) UnicodeDecodeError: ‘utf-8‘ codec can‘t decode 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

報錯全文:Traceback (most recent call last):
File “D:/xiangmu/python/test/提取詞向量.py”, line 13, in
trainWordvec(“評論分詞提取(云南).txt”,500)
File “D:/xiangmu/python/test/提取詞向量.py”, line 8, in trainWordvec
model=word2vec.Word2Vec(sentences,size=sizes)
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\word2vec.py”, line 597, in init
super(Word2Vec, self).init(
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\base_any2vec.py”, line 745, in init
self.build_vocab(sentences=sentences, corpus_file=corpus_file, trim_rule=trim_rule)
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\base_any2vec.py”, line 921, in build_vocab
total_words, corpus_count = self.vocabulary.scan_vocab(
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\word2vec.py”, line 1403, in scan_vocab
total_words, corpus_count = self._scan_vocab(sentences, progress_per, trim_rule)
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\word2vec.py”, line 1372, in _scan_vocab
for sentence_no, sentence in enumerate(sentences):
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\models\word2vec.py”, line 1201, in iter
words, rest = (utils.to_unicode(text[:last_token]).split(),
File “D:\xiangmu\python\test\venv\lib\site-packages\gensim\utils.py”, line 368, in any2unicode
return unicode(text, encoding, errors=errors)
UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xbb in position 7: invalid start byte
解決
把txt文本修改成帶BOM的UTF-8的

總結

以上是生活随笔為你收集整理的问题: return unicode(text, encoding, errors=errors) UnicodeDecodeError: ‘utf-8‘ codec can‘t decode的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。