excel xlsx文件解析 - 使用koogra

4 c# excel .net-4.0 text-parsing xml-parsing

用git hub尝试几个包之后,尝试解析/处理这个相当大的excel文档.我尝试的每一种方法都抛出异常out of memory.

我是谷歌荷兰国际集团有些多了,发现这个GNU库命名koogra这似乎是唯一一个我可以看到适合这项工作,不能打扰太多,继续搜索,因为我的时间不多了对这个项目的一部分.

我现在得到的代码正在通过"内存不足"问题的部分,

所以唯一剩下的就是如何正确地解析Excel文档,这样就可以提取说一种字典集合键是一列而值是另一列.

这是有问题的文件

这是我到目前为止的代码

var path = Path.Combine(Environment.CurrentDirectory, "tst.xlsx");
Net.SourceForge.Koogra.Excel2007.Workbook xcel = new Net.SourceForge.Koogra.Excel2007.Workbook(path);
var ss = xcel.GetWorksheets();
Run Code Online (Sandbox Code Playgroud)

小智 5

发现一些更....谷歌荷兰国际集团 ......第一排的使用率在2007年(xlsx)

第二行是xls版本

        Net.SourceForge.Koogra.IWorkbook genericWB = Net.SourceForge.Koogra.WorkbookFactory.GetExcel2007Reader("tst.xlsx");

        //genericWB = Net.SourceForge.Koogra.WorkbookFactory.GetExcelBIFFReader("some.xls");

        Net.SourceForge.Koogra.IWorksheet genericWS = genericWB.Worksheets.GetWorksheetByIndex(0);

        for (uint r = genericWS.FirstRow; r <= genericWS.LastRow; ++r)
        {
            Net.SourceForge.Koogra.IRow row = genericWS.Rows.GetRow(r);

            for (uint c = genericWS.FirstCol; c <= genericWS.LastCol; ++c)
            {
                // raw value
                Console.WriteLine(row.GetCell(c).Value);

                // formatted value
                Console.WriteLine(row.GetCell(c).GetFormattedValue());
            }
        }
Run Code Online (Sandbox Code Playgroud)

我希望我帮助那些遭遇同样"失忆"问题的人......'享受

对上面代码的一个小更新

好的..我已经玩了一点,所以只要它与文件的内容有关,图表的排名基于Unique IP当前代码是

            //place source file within your current:
            //project directory\bin\debug and you should find extracted file next to the source file 
            var pathtoRead = Path.Combine(Environment.CurrentDirectory, "tst.xlsx");
            var pathtoWrite = Path.Combine(Environment.CurrentDirectory, "tst.txt");

            Net.SourceForge.Koogra.IWorkbook genericWB = Net.SourceForge.Koogra.WorkbookFactory.GetExcel2007Reader(pathtoRead);
            Net.SourceForge.Koogra.IWorksheet genericWS = genericWB.Worksheets.GetWorksheetByIndex(0);
            StringBuilder SbXls = new StringBuilder();
            for (uint r = genericWS.FirstRow; r <= genericWS.LastRow; ++r)
            {
                Net.SourceForge.Koogra.IRow row = genericWS.Rows.GetRow(r);
                string LineEnding = string.Empty;
                for (uint ColCount = genericWS.FirstCol; ColCount <= genericWS.LastCol; ++ColCount)
                {

                    var formated = row.GetCell(ColCount).GetFormattedValue();
                    if (ColCount == 1)
                        LineEnding = Environment.NewLine;
                    else if (ColCount == 0)
                        LineEnding = "\t";
                    if (ColCount > 1 == false)
                        SbXls.Append(string.Concat(formated, LineEnding));
                }
            }
            File.WriteAllText(pathtoWrite, SbXls.ToString());
Run Code Online (Sandbox Code Playgroud)