编码来自HttpWebRequest的内容的问题

ama*_*eur 2 c# asp.net webrequest httpwebrequest httpwebresponse

我使用HttpWebRequest使用以下代码读取网页:

 var pageurl = new Uri(url, UriKind.Absolute);

        var request = (HttpWebRequest)WebRequest.Create(pageurl);
        request.Method = "GET";
        request.AutomaticDecompression = DecompressionMethods.GZip;
        request.KeepAlive = false;
        request.ConnectionGroupName = Guid.NewGuid().ToString();
        request.ServicePoint.Expect100Continue = false;
        request.Pipelined = false;
        request.MaximumResponseHeadersLength = 4;

        if (ignoreCertificateErrors)
        {
            ServicePointManager.ServerCertificateValidationCallback += AcceptAllCertificatePolicy;
        }

        var response = (HttpWebResponse)request.GetResponse();


    if (response != null)
        {
            using (var reader = new StreamReader(response.GetResponseStream()))
            {
                return reader.ReadToEnd();
            }
        }
Run Code Online (Sandbox Code Playgroud)

当传递的语言是英语时,这种方法非常有效,但当其他语言如西班牙语时,我会在返回的内容中获得大量的 .

代码是否有问题,或者是否存在我缺少的编码方式?

svi*_*ick 5

您必须为要下载的页面指定正确的编码StreamReader.例如,如果页面采用ISO-8859-2编码,请使用

new StreamReader(response.GetResponseStream(), Encoding.GetEncoding("ISO-8859-2"))
Run Code Online (Sandbox Code Playgroud)