第1行中缺少或流浪的引用(CSV :: MalformedCSVError)

gar*_*410 5 ruby csv ruby-on-rails

我在ruby/rails中导入此CSV文件时遇到问题

我得到的错误信息是这样的:

Missing or stray quote in line 1 (CSV::MalformedCSVError)
Run Code Online (Sandbox Code Playgroud)

但我不确定发生了什么,因为我的CSV看起来非常好.以下是示例数据:

"lesley_grades","lesley_id","last","first","active","site","cohort","section","sections_title","faculty","completed_term_cred","term","sec_start_date","sec_end_date","grade","stc_cred","active_program","most_recent_program","intent_filed","stc_term_gpa","sta_cum_gpa","start_term","prog_status","last_change_date"
,1234456,John,Doe,TRUE,"Baltimore, MD",0002012,14/FA_ERLIT_6999_U15AA,Directed Independent Study,"Jane Hicks , Jill Saunders",2,14/FA,9/3/14,12/17/14,B-,2,EME.2270.TCBAL.01,EME.2270.TCBAL.01, ,3.3,3.148,12/SU,A,9/2/14
,1234455,John,Doe,TRUE,"Baltimore, MD",0002012,14/FA_ERSPD_6999_U15AG,Directed Independent Study,"Jane Hicks , Jill Saunders",3,14/FA,9/3/14,12/17/14,A-,3,EME.2270.TCBAL.01,EME.2270.TCBAL.01, ,3.3,3.148,12/SU,A,9/2/14
Run Code Online (Sandbox Code Playgroud)

为了给出上下文,有效的csv看起来像这样,lesley_grades作为第一列.假设所有迁移都是预先设置的,则over CSV脚本文件将查找第一列并检查是否激活了Active Record对象,然后将其存储为具有完全相同模型名称的db.

lesley_grades   lesley_id   last   first    active  
                 1234556    Doe    John     TRUE    
                 1123445    Doe    John     TRUE
Run Code Online (Sandbox Code Playgroud)

这是导致我出现问题的代码的一部分

def import!(csv)
 csv_reader = CSV.parse(csv)
 ActiveRecord::Base.transaction do
  csv_reader.each do |row|
    set_record_class_and_columns(row) if header_row?(row)

    if columns_mapping_defined? && record_class_defined? && record_row?(row)
      import_row(row)
    end
  end
  if imports_failed?
    puts 'Aborting importing and rolling back...'
    show_errors
    raise ActiveRecord::Rollback
  end
end
Run Code Online (Sandbox Code Playgroud)

结束

它不能通过这一行 csv_reader = CSV.parse(csv)

在我将引号放入标题之前我遇到了这个错误

Unquoted fields do not allow \r or \n (line 1). (CSV::MalformedCSVError)

UPDATE

CSV从命令行启动,如下所示:

rails runner scripts/import_csv.rb < lesley_grades.csv
Run Code Online (Sandbox Code Playgroud)

然后在这里初始化

CSVImporter.new.import!($stdin)
Run Code Online (Sandbox Code Playgroud)

但正如@smathy建议我将方法更改为CSV.parse(csv.gsub /\r /,'')

但是现在def import!采用gsub块的方法会产生这个错误

in `import!': undefined method `gsub' for #<IO:<STDIN>> (NoMethodError)
Run Code Online (Sandbox Code Playgroud)

不确定如何使CSV成为对象?

任何建议或重构使这项工作?谢谢大家

sma*_*thy 13

您的CSV数据来自Windows并且具有CRLF(即"\ r \n")行结尾而不是"\n",您需要在尝试解析之前删除"\ r":

CSV.parse(csv.gsub /\r/, '')
Run Code Online (Sandbox Code Playgroud)

更新

来自OP的其他信息:

CSV.parse(csv.read.gsub /\r/, '')
Run Code Online (Sandbox Code Playgroud)