FileReader：使用javascript读取许多文件而不会发生内存泄漏

Question

FileReader：使用javascript读取许多文件而不会发生内存泄漏

在网页中，我必须读取文件的一小部分，对于许多（1500-12000）小文件来说，每个文件的大小约为1 Mb。收集所需信息后，将其推回到服务器上。

我的问题：我使用FileReader API，垃圾收集无法正常工作，并且内存消耗激增。

代码如下：

function extract_information_from_files(input_files) {

//some dummy implementation
for (var i = 0; i < input_files.length; ++i) {


    (function dummy_function(file) {

        var reader = new FileReader();

        reader.onload = function () {

            //convert to Uint8Array because used library expects this

            var array_buffer = new Uint8Array(reader.result);

            //do some fancy stuff with the library (very small subset of data is kept)

            //finish

            //function call ends, expect garbage collect to start cleaning.
            //even explicit dereferencing does not work
        };

        reader.readAsArrayBuffer(file);

    })(input_files[i]);

}

Run Code Online (Sandbox Code Playgroud)

}

一些说明：

不，乍一看，该库似乎未保留对已加载对象的任何引用。即使您使用array_buffer运行如上所示的代码不使用，所有内容都会保存在内存中。

行为因浏览器而异：

Chrome（43）无法清除所有内容

Firefox（38）似乎使用的剩余内存使用量约为所有文件大小的1/3

我发现很少有讨论互联网上相同问题的主题。我尝试过的是：

FileReader之后是否可以清除内存？->旧的File.prototype.mozSlice已更改为.slice，但即使如此，问题仍然存在

http://www.joelandritsch.com/posts/lessons-learned-in-javascript-11- >建议的解决方案不起作用。

https://developer.mozilla.org/zh-CN/docs/Web/JavaScript/Memory_Management对我来说不是很清楚。->似乎首先您不需要取消引用（请参见不需要对象与无法访问对象），然后它们还声明“限制：需要使对象显式不可访问”

当结合使用FileReader和https://gildas-lormeau.github.io/zip.js/时，最后一个奇怪的细节（为完整性而发布），我在将文件推送到zip存档之前读取了文件，垃圾收集才可以正常工作。

所有这些说明似乎都指向我无法使用FileReader，因此请告诉我如何使用。

Answer 1

m4k*_*tub 3

该问题可能与执行顺序有关。在您的for循环中，您正在读取所有带有reader.readAsArrayBuffer(file). onload该代码将在为读者运行任何代码之前运行。根据浏览器的实现，这可能意味着浏览器在调用FileReader任何文件之前加载整个文件（或者简单地为整个文件预分配缓冲区）。onload

尝试像队列一样处理文件，看看是否有区别。就像是：

function extract_information_from_files(input_files) {
    var reader = new FileReader();

    function process_one() {
        var single_file = input_files.pop();
        if (single_file === undefined) {
            return;
        }

        (function dummy_function(file) {
            //var reader = new FileReader();

            reader.onload = function () {
                // do your stuff
                // process next at the end
                process_one();
            };

            reader.readAsArrayBuffer(file);
        })(single_file);
    }

    process_one();
}

extract_information_from_files(file_array_1);
// uncomment next line to process another file array in parallel
// extract_information_from_files(file_array_2);

Run Code Online (Sandbox Code Playgroud)

编辑：浏览器似乎希望您重用FileReaders。我编辑了代码以重用单个阅读器，并测试（在 Chrome 中）内存使用量仍仅限于您读取的最大文件。

归档时间：	10 年，7 月前
查看次数：	2197 次
最近记录：	10 年，7 月前