歡迎您光臨本站 註冊首頁

httpclient抓取網頁內容

←手機掃碼閱讀     火星人 @ 2014-03-09 , reply:0

1.想下載遠程URL地址的內容.可以使用httpclient現在整理一下相關的代碼:

而且解決中文亂碼問題

方法一:流轉碼

public String convertStreamToString(InputStream is) throws UnsupportedEncodingException {

BufferedReader reader = new BufferedReader(new InputStreamReader(is,"gbk"));

StringBuilder sb = new StringBuilder();

String line = null;

try {

while ((line = reader.readLine()) != null) { sb.append(line "n");

}

} catch (IOException e) {

e.printStackTrace();

} finally {

try {

is.close();

} catch (IOException e) {

e.printStackTrace();

}

}

return sb.toString();

}

//下載內容

private String urlContent(String urlString) throws HttpException, IOException {

HttpClient client = new HttpClient();

GetMethod get = new GetMethod("http://www.tianya.cn/publicforum/articleslist/0/no20.shtml"); client.executeMethod(get); System.out.print(get.getResponseCharSet()); InputStream iStream = get.getResponseBodyAsStream();

String contentString = convertStreamToString(iStream);

get.releaseConnection();

return contentString;

}

通過GET方法能夠實現下載網頁內容出來的


[火星人 ] httpclient抓取網頁內容已經有394次圍觀

http://coctec.com/docs/java/show-post-60846.html