Cit*_*lla 3 java twitter filter twitter4j hashtag
我正在使用Twitter4j开发应用程序.我正在尝试使用某个标签导入推文(例如:天气)然后,我想通过搜索关键字对推文进行分类.
例如:导入的一些推文可能是
- OMG, I hate this rain #weather
- This sunshine makes me feel so happy #weather
- Such strange #weather! One moment it rains, the next the sun shines. Confusing!
- Rain makes me sad #weather
- I love the sunshine! #weather
Run Code Online (Sandbox Code Playgroud)
然后,我想将这些推文归类为:
- hate, Confusing, sad,... are negative
- happy, love,... are positive
Run Code Online (Sandbox Code Playgroud)
PositiveTweets将是:
- This sunshine makes me feel so happy #weather
- I love the sunshine! #weather
Run Code Online (Sandbox Code Playgroud)
NegativeTweets将是:
- OMG, I hate this rain #weather
- Such strange #weather! One moment it rains, the next the sun shines. Confusing!
- Rain makes me sad #weather
Run Code Online (Sandbox Code Playgroud)
所以,NegativeTweets=3和PositiveTweets=2
任何人都可以帮我这个或指向类似的东西吗?
您可以查询#weather主题标签,然后将推文分成单独的列表,具体取决于它们是否包含您为好天气或恶劣天气指定的任何关键字.
public static void main(String[] args) throws TwitterException {
List<Tweet> goodWeather = new ArrayList<Tweet>();
List<Tweet> badWeather = new ArrayList<Tweet>();
Twitter twitter = new TwitterFactory().getInstance();
System.out.println("Fetching Weather Data...");
// get the 1000 most recent tweets tagged #weather
for (int page = 1; page <= 10; page++) {
Query query = new Query("#weather");
query.setRpp(100); // 100 results per page
query.setPage(page);
QueryResult qr = twitter.search(query);
List<Tweet> qrTweets = qr.getTweets();
// break out if there are no more tweets
if(qrTweets.size() == 0) break;
// separate tweets into good and bad bins
for(Tweet t : qrTweets) {
if (t.getText().toLowerCase().contains("happy") ||
t.getText().toLowerCase().contains("love")) {
goodWeather.add(t);
}
if (t.getText().toLowerCase().contains("sad") ||
t.getText().toLowerCase().contains("hate")) {
badWeather.add(t);
}
}
}
System.out.println("Good Weather: " + goodWeather.size());
for (Tweet good : goodWeather) {
System.out.println(good.getCreatedAt() + ": " + good.getText());
}
System.out.println("\nBad Weather: " + badWeather.size());
for (Tweet bad : badWeather) {
System.out.println(bad.getCreatedAt() + ": " + bad.getText());
}
}
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
4119 次 |
| 最近记录: |