I'm following the Python for Data Analysis book. It tells me to get the ALL file from http://www.fec.gov/disclosurep/PDownload.do and load it with pandas:
import pandas as pd
fec = pd.read_csv('P00000001-ALL.csv')
But the actual file has changed since the book was written. The old file (which is available here https://github.com/pydata/pydata-book/blob/master/ch09/P00000001-ALL.csv) loads just fine
fec = pd.read_csv('../pydata-book/ch09/P00000001-ALL.csv')
But the new one is loaded wrong, in that the columns seem to have shifted (the first column value is dropped)
cmte_id P60008059
cand_id Bush, Jeb
cand_nm EASTON, AMY KELLY MRS.
contbr_nm KEY BISCAYNE
contbr_city FL
contbr_st 331491716
contbr_zip HOMEMAKER
contbr_employer HOMEMAKER
contbr_occupation 2700
contb_receipt_amt 26-JUN-15
contb_receipt_dt NaN
receipt_desc NaN
memo_cd NaN
memo_text SA17A
form_tp 1024106
file_num SA17.114991
tran_id P2016
election_tp NaN
The actual row is
C00579458,"P60008059","Bush, Jeb","EASTON, AMY KELLY MRS.","KEY BISCAYNE","FL","331491716","HOMEMAKER","HOMEMAKER",2700,26-JUN-15,"","","","SA17A","1024106","SA17.114991","P2016",
So that C00579458 is lost somewhere.
The header looks like this. cmte_id,cand_id,cand_nm,contbr_nm,contbr_city,contbr_st,contbr_zip,contbr_employer,contbr_occupation,contb_receipt_amt,contb_receipt_dt,receipt_desc,memo_cd,memo_text,form_tp,file_num,tran_id,election_tp