如何登录网站?

Yat*_*oel 2 java authentication login httpclient

我想通过java程序登录ORKUT.我正在使用以下程序来完成它.我从一些网站上复制了它.现在我想将它用于ORKUT.但我对某些问题有一些疑问.

Q1.在哪里提供登录页面的URL(我想在新的HTTPGET(".....")中)?我是对还是不对?

Q2.传递给HTTPPost("")的构造函数的参数.如果我们必须在登录网页的html源代码中传递"form"元素的"action"属性的值,那么请确认它.

Q3.ORKUT登录页面的"form"元素具有该属性

onsubmit="return(gaia_onLoginSubmit());"
Run Code Online (Sandbox Code Playgroud)

由于存在上述属性,我是否需要对以下代码进行任何更改?

Q4.如何在登录后获取html网页源代码?

import java.util.ArrayList;
import java.util.List;
import org.apache.http.HttpEntity;
import org.apache.http.HttpResponse;
import org.apache.http.NameValuePair;
import org.apache.http.client.entity.UrlEncodedFormEntity;
import org.apache.http.client.methods.HttpGet;
import org.apache.http.client.methods.HttpPost;
import org.apache.http.cookie.Cookie;
import org.apache.http.impl.client.DefaultHttpClient;
import org.apache.http.message.BasicNameValuePair;
import org.apache.http.protocol.HTTP;


public class ClientFormLogin {

public static void main(String[] args) throws Exception {

    DefaultHttpClient httpclient = new DefaultHttpClient();

    HttpGet httpget = new HttpGet("https://www.google.com/accounts/ServiceLogin?service=orkut&hl=en-US&rm=false&continue=http%3A%2F%2Fwww.orkut.com%2FRedirLogin%3Fmsg%3D0%26page%3Dhttp%253A%252F%252Fwww.orkut.co.in%252FHome.aspx&cd=IN&passive=true&skipvpage=true&sendvemail=false");

    HttpResponse response = httpclient.execute(httpget);
    HttpEntity entity = response.getEntity();

    System.out.println("Login form get: " + response.getStatusLine());
    if (entity != null) {
        entity.consumeContent();
    }
    System.out.println("Initial set of cookies:");
    List<Cookie> cookies = httpclient.getCookieStore().getCookies();
    if (cookies.isEmpty()) {
        System.out.println("None");
    } else {
        for (int i = 0; i < cookies.size(); i++) {
            System.out.println("- " + cookies.get(i).toString());
        }
    }

    HttpPost httpost = new HttpPost("https://www.google.com/accounts/ServiceLoginAuth?service=orkut");

    List <NameValuePair> nvps = new ArrayList <NameValuePair>();
    nvps.add(new BasicNameValuePair("Email", "username"));
    nvps.add(new BasicNameValuePair("Passwd", "password"));

    httpost.setEntity(new UrlEncodedFormEntity(nvps, HTTP.UTF_8));

    response = httpclient.execute(httpost);
    entity = response.getEntity();

    System.out.println("Login form get: " + response.getStatusLine());
    if (entity != null) {
        entity.consumeContent();
    }

    System.out.println("Post logon cookies:");
    cookies = httpclient.getCookieStore().getCookies();
    if (cookies.isEmpty()) {
        System.out.println("None");
    } else {
        for (int i = 0; i < cookies.size(); i++) {
            System.out.println("- " + cookies.get(i).toString());
        }
    }

    // When HttpClient instance is no longer needed, 
    // shut down the connection manager to ensure
    // immediate deallocation of all system resources
    httpclient.getConnectionManager().shutdown();        
Run Code Online (Sandbox Code Playgroud)

Bob*_*Gee 5

Q1:

执行此操作的标准方法是登录URL的HTTP POST,其中登录信息作为方法体中的参数.这通常是用户名和密码(或者可能是密码的哈希值).

会话cookie可以从响应头(或其cookie)中检索,然后作为属性添加到站点的未来HTTP GET或作为请求头.

Q2:

我认为这取决于网站.不确定 - 尝试修补Firefox和Live HTTP Headers扩展.

Q3:

可能不是.

Q4:

在HTTP GET之后使用Method.getResponseBodyAsString或Method.getResponseBody或Method.getResponseBodyAsStream来检索响应,该响应将包含页面的HTML源.