在合并两个csv表格时,表中有汉字,初始代码如下
data2013 = pd.read_csv(path1 + "-4.csv", encoding='utf-8', header=None, low_memory=False) data2014 = pd.read_csv(path2 + "-4.csv", encoding='utf-8', header=None, low_memory=False) mkdir('daily/lonely_2010-2019/' + path) data2013.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8", header=False, index=False, mode='a') data2014.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8", header=False, index=False, mode='a')产生了错列和汉字乱码的问题 这种问题是因为编码格式问题,在改为
data2013 = pd.read_csv(path1 + "-4.csv", encoding='utf-8', header=None, low_memory=False) data2014 = pd.read_csv(path2 + "-4.csv", encoding='utf-8', header=None, low_memory=False) mkdir('daily/lonely_2010-2019/' + path) data2013.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8_sig", header=False, index=False, mode='a') data2014.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8_sig", header=False, index=False, mode='a')可恢复正常。