Android Java UTF-8 HttpClient问题

Mic*_*art 16 java android httpclient

我从一个网页抓取的JSON数组有奇怪的字符编码问题.服务器正在发送回此标头:

内容类型文本/ javascript; 字符集= UTF-8

此外,我可以查看Firefox或任何浏览器中的JSON输出,并正确显示Unicode字符.响应有时会包含来自其他语言的单词,带有重音符号等.但是当我把它拉下来并把它放到Java中的字符串时,我得到那些奇怪的问号.这是我的代码:

HttpParams params = new BasicHttpParams();
HttpProtocolParams.setVersion(params, HttpVersion.HTTP_1_1);
HttpProtocolParams.setContentCharset(params, "utf-8");
params.setBooleanParameter("http.protocol.expect-continue", false);

HttpClient httpclient = new DefaultHttpClient(params);

HttpGet httpget = new HttpGet("http://www.example.com/json_array.php");
HttpResponse response;
    try {
        response = httpclient.execute(httpget);

        if(response.getStatusLine().getStatusCode() == 200){
            // Connection was established. Get the content. 

            HttpEntity entity = response.getEntity();
            // If the response does not enclose an entity, there is no need
            // to worry about connection release

            if (entity != null) {
                // A Simple JSON Response Read
                InputStream instream = entity.getContent();
                String jsonText = convertStreamToString(instream);

                Toast.makeText(getApplicationContext(), "Response: "+jsonText, Toast.LENGTH_LONG).show();

            }

        }


    } catch (MalformedURLException e) {
        Toast.makeText(getApplicationContext(), "ERROR: Malformed URL - "+e.getMessage(), Toast.LENGTH_LONG).show();
        e.printStackTrace();
    } catch (IOException e) {
        Toast.makeText(getApplicationContext(), "ERROR: IO Exception - "+e.getMessage(), Toast.LENGTH_LONG).show();
        e.printStackTrace();
    } catch (JSONException e) {
        Toast.makeText(getApplicationContext(), "ERROR: JSON - "+e.getMessage(), Toast.LENGTH_LONG).show();
        e.printStackTrace();
    }

private static String convertStreamToString(InputStream is) {
    /*
     * To convert the InputStream to String we use the BufferedReader.readLine()
     * method. We iterate until the BufferedReader return null which means
     * there's no more data to read. Each line will appended to a StringBuilder
     * and returned as String.
     */
    BufferedReader reader;
    try {
        reader = new BufferedReader(new InputStreamReader(is, "UTF-8"));
    } catch (UnsupportedEncodingException e1) {
        // TODO Auto-generated catch block
        e1.printStackTrace();
    }
    StringBuilder sb = new StringBuilder();

    String line;
    try {
        while ((line = reader.readLine()) != null) {
            sb.append(line + "\n");
        }
    } catch (IOException e) {
        e.printStackTrace();
    } finally {
        try {
            is.close();
        } catch (IOException e) {
            e.printStackTrace();
        }
    }
    return sb.toString();
}
Run Code Online (Sandbox Code Playgroud)

正如您所看到的,我在InputStreamReader上指定了UTF-8,但每次我通过Toast查看返回的JSON文本时都会出现奇怪的问号.我想我需要将InputStream发送到byte []而不是?

在此先感谢您的帮助.

Vit*_*nko 38

试试这个:

if (entity != null) {
    // A Simple JSON Response Read
    // InputStream instream = entity.getContent();
    // String jsonText = convertStreamToString(instream);

    String jsonText = EntityUtils.toString(entity, HTTP.UTF_8);

    // ... toast code here
}
Run Code Online (Sandbox Code Playgroud)

  • @Michael:这个答案非常好,如果我问这个问题,我会接受这个. (3认同)

Ste*_*n C 5

@ Arhimed的答案是解决方案.但我看不出你的convertStreamToString代码有什么明显错误.

我的猜测是:

  1. 服务器在流的开头放置UTF字节顺序标记(BOM).标准的Java UTF-8字符解码器不会删除BOM,因此很可能会在结果字符串中结束.(但是,EntityUtils的代码似乎也没有对BOM做任何事情.)
  2. convertStreamToString正在一次读取一行字符串,并使用硬连线'\n'作为行尾标记重新组合它.如果要将其写入外部文件或应用程序,则应该使用特定于平台的行尾标记.