相关疑难解决方法(0)

static public string Serialize()
{
     string returnValue;
     System.IO.FileInfo file1 = new FileInfo(@"c:\file.txt");
     returnValue = System.IO.File.ReadAllText(file1.ToString());
}

Run Code Online (Sandbox Code Playgroud)

sha*_*ram

2009 09-03

3
推荐指数

1
解决办法

2151
查看次数

在 C# 中有效地在大文件中搜索字符串

我正在构建一个通过比较哈希来扫描文件的应用程序。我需要搜索超过 1GB 的哈希值来获取文件的哈希值。我为此找到了其他解决方案，例如 Aho-Corasick，但它比File.ReadLines(file).Contains(str).

这是迄今为止最快的代码，使用File.ReadLines. 扫描一个文件大约需要 8 秒，而使用 Aho-Corasick 扫描一个文件大约需要 2 分钟。由于显而易见的原因，我无法将整个哈希文件读入内存。

IEnumerable<DirectoryInfo> directories = new DirectoryInfo(scanPath).EnumerateDirectories();
IEnumerable<FileInfo> files = new DirectoryInfo(scanPath).EnumerateFiles();

FileInfo hashes = new FileInfo(hashPath);
await Task.Run(() =>
{
    IEnumerable<string> lines = File.ReadLines(hashes.FullName);
    
    foreach (FileInfo file in files) {
        if (!AuthenticodeTools.IsTrusted(file.FullName))
        {
            string hash = getHash(file.FullName);
            if (lines.Contains(hash)) flaggedFiles.Add(file.FullName);
        }
        filesScanned += 1;
    }
});
foreach (DirectoryInfo directory in directories)
{
    await scan(directory.FullName, hashPath);
    directoriesScanned += 1;
}

Run Code Online (Sandbox Code Playgroud)

编辑：根据请求，以下是文件内容的示例：

5c269c9ec0255bbd9f4e20420233b1a7
63510b1eea36a23b3520e2b39c35ef4e
0955924ebc1876f0b849b3b9e45ed49d

Run Code Online (Sandbox Code Playgroud)

它们是 …

c# performance search large-files

Com*_*k12

2021 02-01

1
推荐指数

1
解决办法

154
查看次数

标签统计

c# ×4

string ×2

.net ×1

.net-4.0 ×1

c#-4.0 ×1

large-files ×1

matching ×1

performance ×1

search ×1

text ×1

匹配大文本文件中的字符串？

在C#中处理大型文本文件

有没有办法在部分中读取大文本文件？

在 C# 中有效地在大文件中搜索字符串

标签 统计

标签统计