Difference between revisions of "Python: TwitterSearch"

From OnnoWiki
Jump to navigation Jump to search
 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
This library allows you easily create a search through the Twitter API without having to know too much about the API details. Based on such a search you can even iterate throughout all tweets reachable via the Twitter Search API. There is an automatic reload of the next pages while using the iteration. TwitterSearch was developed as part of an interdisciplinary project at the Technische Universität München.
+
==Instalasi==
Reasons to use TwitterSearch
 
  
Well, because it can be quite annoying to always parse the search url together and a minor spelling mistake is sometimes hard to find. Not to mention the pain of getting the next page of the results. Why not centralize this process and concentrate on the more important parts of the project?
 
 
More than that, TwitterSearch is:
 
 
* pretty small (around 500 lines of code currently)
 
* pretty easy to use, even for beginners
 
* pretty good at giving you all available information (including meta information)
 
* pretty iterable without any need to manually reload more results from the API
 
* pretty wrong values of API arguments are to raise an exception. This is done before the API gets queried and therefore helps to avoid to reach Twitters’ limitations by obviously wrong API calls
 
* pretty friendly to Python >= 2.7 and Python >= 3.2
 
* pretty pretty to look at :)
 
 
==Installation==
 
 
TwitterSearch is also available on pypi and therefore can be installed via
 
  
 
  pip install TwitterSearch
 
  pip install TwitterSearch
  
or
+
atau
  
 
  easy_install TwitterSearch
 
  easy_install TwitterSearch
  
If you’d like to work with bleeding edge versions you’re free to clone the devel branch. A manual installation can be done doing by downloading or cloning the repository and running
 
  
python setup.py install.
+
==Search Twitter==
  
==Search Twitter==
+
Contoh search dengan beberapa keyword fungsi OR
  
Everybody knows how much work it is to study at a university. So why not take a small shortcut? So in this example we assume we would like to find out how to copy a doctorate thesis in Germany. Let’s have a look what the Twitter users have to say about Mr Guttenberg.
+
from TwitterSearch import *
  
from TwitterSearch import *
 
 
 
  tso = TwitterSearchOrder() # create a TwitterSearchOrder object
 
  tso = TwitterSearchOrder() # create a TwitterSearchOrder object
  tso.set_keywords(['ahok']) # let's define all words we would like to have a look for
+
keywords=("ahok","anies","sylv")
 +
  tso.set_keywords(keywords, or_operator=True)
 
  tso.set_language('id') # we want to see Indonesian tweets only
 
  tso.set_language('id') # we want to see Indonesian tweets only
 
  tso.set_include_entities(True) # and don't give us all those entity information
 
  tso.set_include_entities(True) # and don't give us all those entity information
Line 41: Line 23:
 
  # it's about time to create a TwitterSearch object with our secret tokens
 
  # it's about time to create a TwitterSearch object with our secret tokens
 
  ts = TwitterSearch(
 
  ts = TwitterSearch(
     consumer_key = '',
+
     consumer_key = ' ',
     consumer_secret = '',
+
     consumer_secret = ' ',
     access_token = '',
+
     access_token = ' ',
     access_token_secret = ''
+
     access_token_secret = ' '
 
     )
 
     )
 
   
 
   
 
  # this is where the fun actually starts :)
 
  # this is where the fun actually starts :)
 
  for tweet in ts.search_tweets_iterable(tso):
 
  for tweet in ts.search_tweets_iterable(tso):
    print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )
+
print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'].encode('ascii','ignore').decode('ascii') ) )
 +
 
  
 
Hasilnya kira-kira sebagai berikut,
 
Hasilnya kira-kira sebagai berikut,
  
  @Babannic2 tweeted: RT @Beritasatu: Pendeta Stephen Tong Sebut Jokowi dan Ahok Tipe Pemimpin Antikorupsi https://t.co/YJomhnXBVW
+
  @suaramasjid tweeted: Inilah Kesaksian Ketum MUI di Persidangan Ahok https://t.co/KLKBLgYu5q
  @JSJkrysx tweeted: Sekjen PDIP: Ahok-Djarot Konkret, Paslon Lain Retorika https://t.co/Vs6KKYSHiV
+
  @miraclecathy tweeted: Penyakit kanker, jantung, otak itu hrs segera ditangani. Ga ada lg ditolak RS dgn alasan penuh. Ahok peduli itu, sy https://t.co/F2jjtwOkn3
  @FirmanRenold tweeted: RT @jokoanwar: “Rakyat adalah BOS kami. Kami adalah pelayan rakyat.”
+
  @sofandresyabanu tweeted: Sidang Kemarin, Ahok Tidak Bertujuan Laporkan Saksi Maruf Amin, Tapi Mau Laporkan Habib Muchsin  https://t.co/B2UISB8xwg
 
+
  @ugisugiharto tweeted: RT @elpanjullo: https://t.co/EwE2J6YFVw Ayeeeee dukung artikel bagus ini... Bebaskan Ahok Demi Keadilan! Ayeeeee 100% yakin Ahok Kagak Sala
—Ahok https://t.co/cKkR3kOcbK
+
  @bangher_id tweeted: RT pks_cakung: Emak3 siap mengantarkan Anies Sandi menjadi gubernur DKI #AniesSandiIdamanRakyat PKSJakarta JktMaju https://t.co/jIvPcQGgSy
  @Takviri tweeted: RT @AtunSumiatun: Ahok bermain tanpa beban. https://t.co/uyRBQ3j0aS
+
  @atopbbx tweeted: Metro_TV: [Populer] Jubir Ahok: Megawati Bakal Turun di Saat yang Tepat https://t.co/d9ekkDaCF7 https://t.co/xiKzYanGZF
  @NKRIndonesia79 tweeted: Siapapun yang "melawan" Ahok pasti jadi salah? Sepertinya iya... https://t.co/gPyn8HpnkP
+
  @ssirah tweeted: RT @nay_shihab: KH Didin Hafidhuddin: Ahok Membahayakan
  @antonkerenz1 tweeted: RT @mantriss: Wah hebat pendukung Pak Ahok bicara kesopanan .... Semoga betul2 sudah berubah.... 😃😃 https://t.co/sh1DDadg5K
+
https://t.co/M65Fkx0es8
  @tom_psp tweeted: RT @Beritasatu: #PopulerB1 1: Diajak Sandi Keroyok Ahok, Sylvi Tak Paham https://t.co/0KxkEgr8NJ https://t.co/CGj6qVUHvJ
 
  
 
==Access User Timelines==
 
==Access User Timelines==
Line 73: Line 55:
 
  # it's about time to create a TwitterSearch object with our secret tokens
 
  # it's about time to create a TwitterSearch object with our secret tokens
 
  ts = TwitterSearch(
 
  ts = TwitterSearch(
     consumer_key = '',
+
     consumer_key = ' ',
     consumer_secret = '',
+
     consumer_secret = ' ',
     access_token = '',
+
     access_token = ' ',
     access_token_secret = ''
+
     access_token_secret = ' '
 
     )
 
     )
 
   
 
   
Line 90: Line 72:
 
  @onnowpurbo tweeted: openwrt dulunya utk router, cuma dengan kemampuan yang ada bisa utk macem2 :) .. https://t.co/3VZgng1RLd
 
  @onnowpurbo tweeted: openwrt dulunya utk router, cuma dengan kemampuan yang ada bisa utk macem2 :) .. https://t.co/3VZgng1RLd
 
  @onnowpurbo tweeted: bisa diskusi via email onno@indo.net.id, beraba besar komunitas pulpstone? https://t.co/GEksxU49Zy
 
  @onnowpurbo tweeted: bisa diskusi via email onno@indo.net.id, beraba besar komunitas pulpstone? https://t.co/GEksxU49Zy
 
 
==Detail dari TwitterSearch?==
 
 
Jika Anda ingin mendapatkan informasi lebih lanjut tentang bagaimana TwitterSearch bekerja secara internal dan bagaimana menggunakannya dengan semua kemungkinan itu kita lihat dokumentasi terbaru. Sebuah changelog juga tersedia dalam repositori https://github.com/ckoepp/TwitterSearch/tree/master/docs
 
Update ke 1.0.0 dan yang lebih baru
 
 
If you’re upgrading from a version < 1.0.0 be aware that the API changed! As part of the process to obtain PEP-8 compatibility all methods had to be renamed. The code changes to support the PEP-8 naming scheme are trivial. Just change the old method naming scheme from setKeywords(...) to the new one of set_keywords(...).
 
 
Apart from this issue, four other API changes were introduced with version 1.0.0:
 
 
* simplified proxy functionality (no usage of dicts but plain strings as only HTTPS proxies can be supported anyway)
 
* simplified geo-code parameter (TwitterSearchOrder.set_geocode(...,metric=True) renamed to set_geocode(...,imperial_metric=True))
 
* simplified TwitterSearch.get_statistics() from dict to tuple style ({'queries':<int>, 'tweets':<int>} to (<int>,<int>))
 
* additional feature: timelines of users can now be accessed using the new class TwitterUserOrder
 
 
In total those changes can be done quickly without browsing the documentation.
 
 
If you’re unable apply those changes, you might consider using TwitterSearch versions < 1.0.0. Those will stay available through pypi and therefore will be installable in the future using the common installation methods like pip install -I TwitterSearch==0.78.6. Using the release tags is another easy way to navigate through all versions of this library.
 
 
 
 
  
 
==Referensi==
 
==Referensi==
Line 117: Line 77:
 
* https://pypi.python.org/pypi/TwitterSearch/
 
* https://pypi.python.org/pypi/TwitterSearch/
 
* https://github.com/ckoepp/TwitterSearch/tree/master/docs
 
* https://github.com/ckoepp/TwitterSearch/tree/master/docs
 +
* https://github.com/ckoepp/TwitterSearch/blob/master/tests/test_tso.py

Latest revision as of 12:02, 2 February 2017

Instalasi

pip install TwitterSearch

atau

easy_install TwitterSearch


Search Twitter

Contoh search dengan beberapa keyword fungsi OR

from TwitterSearch import *

tso = TwitterSearchOrder() # create a TwitterSearchOrder object
keywords=("ahok","anies","sylv")
tso.set_keywords(keywords, or_operator=True)
tso.set_language('id') # we want to see Indonesian tweets only
tso.set_include_entities(True) # and don't give us all those entity information

# it's about time to create a TwitterSearch object with our secret tokens
ts = TwitterSearch(
    consumer_key = ' ',
    consumer_secret = ' ',
    access_token = ' ',
    access_token_secret = ' '
    )

# this is where the fun actually starts :)
for tweet in ts.search_tweets_iterable(tso):
	print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'].encode('ascii','ignore').decode('ascii') ) )


Hasilnya kira-kira sebagai berikut,

@suaramasjid tweeted: Inilah Kesaksian Ketum MUI di Persidangan Ahok https://t.co/KLKBLgYu5q
@miraclecathy tweeted: Penyakit kanker, jantung, otak itu hrs segera ditangani. Ga ada lg ditolak RS dgn alasan penuh. Ahok peduli itu, sy https://t.co/F2jjtwOkn3
@sofandresyabanu tweeted: Sidang Kemarin, Ahok Tidak Bertujuan Laporkan Saksi Maruf Amin, Tapi Mau Laporkan Habib Muchsin  https://t.co/B2UISB8xwg
@ugisugiharto tweeted: RT @elpanjullo: https://t.co/EwE2J6YFVw Ayeeeee dukung artikel bagus ini... Bebaskan Ahok Demi Keadilan! Ayeeeee 100% yakin Ahok Kagak Sala
@bangher_id tweeted: RT pks_cakung: Emak3 siap mengantarkan Anies Sandi menjadi gubernur DKI #AniesSandiIdamanRakyat PKSJakarta JktMaju https://t.co/jIvPcQGgSy
@atopbbx tweeted: Metro_TV: [Populer] Jubir Ahok: Megawati Bakal Turun di Saat yang Tepat https://t.co/d9ekkDaCF7 https://t.co/xiKzYanGZF
@ssirah tweeted: RT @nay_shihab: KH Didin Hafidhuddin: Ahok Membahayakan

https://t.co/M65Fkx0es8

Access User Timelines

Alternatif lain adalah mengakses timeline seorang user saja, caranya,

from TwitterSearch import *

tuo = TwitterUserOrder('onnowpurbo') # create a TwitterUserOrder

# it's about time to create a TwitterSearch object with our secret tokens
ts = TwitterSearch(
    consumer_key = ' ',
    consumer_secret = ' ',
    access_token = ' ',
    access_token_secret = ' '
    )

# start asking Twitter about the timeline
for tweet in ts.search_tweets_iterable(tuo):
	print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )

Hasilnya kira-kira,

@onnowpurbo tweeted: alhamdulillah :) ...  https://t.co/JRQotgThNF
@onnowpurbo tweeted: keren! ... https://t.co/xuCqpVJMAd
@onnowpurbo tweeted: sama2 lihat aktifitas di pulpstone kayanya menarik nih .. https://t.co/PN3nnf0ztW
@onnowpurbo tweeted: openwrt dulunya utk router, cuma dengan kemampuan yang ada bisa utk macem2 :) .. https://t.co/3VZgng1RLd
@onnowpurbo tweeted: bisa diskusi via email onno@indo.net.id, beraba besar komunitas pulpstone? https://t.co/GEksxU49Zy

Referensi