python——從csv文件中隨機提取某幾行添加到另一個csv文件中(含代碼)

舉個例子,從a.csv裏隨機提取10%的數據到b.csv,且兩個文件的列名(表頭)相同,兩個文件的cloumns一樣,解決中文亂碼

直接上代碼吧不廢話了:

import pandas as pd
data = pd.read_csv('a.csv')

# df.sample(n=None, frac=None, replace=False, weights=None, random_state=None, axis=None)
# n = number of rows(optional, cannot be used with frac) 抽取的行數;
# frac = fraction/proportion(optional, cannot be used with n) 抽取的比例;
# replace = Allow or disallow sampling of the same row more than once (boolean, default False) 是否爲有放回抽樣;
# weights (str or ndarray-like, optional) 權重
# random_state (int to use as interval, or call np.random.get_state(), optional) 整數作爲間隔,或者調用np.random.get_state()
# axis = extract row or column (0->row, 1->column) 抽取行還是列(0是行,1是列)

# random select 10% from dataset
sample = data.sample(frac=0.1, random_state=5, axis=0)
# export to csv file
sample.to_csv('b.csv',encoding='utf_8_sig')

 

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章