Difference between revisions of "Python: TwitterSearch"

From OnnoWiki
Jump to navigation Jump to search
 
(13 intermediate revisions by the same user not shown)
Line 1: Line 1:
This library allows you easily create a search through the Twitter API without having to know too much about the API details. Based on such a search you can even iterate throughout all tweets reachable via the Twitter Search API. There is an automatic reload of the next pages while using the iteration. TwitterSearch was developed as part of an interdisciplinary project at the Technische Universität München.
+
==Instalasi==
Reasons to use TwitterSearch
 
  
Well, because it can be quite annoying to always parse the search url together and a minor spelling mistake is sometimes hard to find. Not to mention the pain of getting the next page of the results. Why not centralize this process and concentrate on the more important parts of the project?
 
 
More than that, TwitterSearch is:
 
 
* pretty small (around 500 lines of code currently)
 
* pretty easy to use, even for beginners
 
* pretty good at giving you all available information (including meta information)
 
* pretty iterable without any need to manually reload more results from the API
 
* pretty wrong values of API arguments are to raise an exception. This is done before the API gets queried and therefore helps to avoid to reach Twitters’ limitations by obviously wrong API calls
 
* pretty friendly to Python >= 2.7 and Python >= 3.2
 
* pretty pretty to look at :)
 
 
==Installation==
 
 
TwitterSearch is also available on pypi and therefore can be installed via
 
  
 
  pip install TwitterSearch
 
  pip install TwitterSearch
  
or
+
atau
  
 
  easy_install TwitterSearch
 
  easy_install TwitterSearch
  
If you’d like to work with bleeding edge versions you’re free to clone the devel branch. A manual installation can be done doing by downloading or cloning the repository and running
 
 
python setup.py install.
 
  
 
==Search Twitter==
 
==Search Twitter==
  
Everybody knows how much work it is to study at a university. So why not take a small shortcut? So in this example we assume we would like to find out how to copy a doctorate thesis in Germany. Let’s have a look what the Twitter users have to say about Mr Guttenberg.
+
Contoh search dengan beberapa keyword fungsi OR
  
 
from TwitterSearch import *
 
from TwitterSearch import *
try:
 
    tso = TwitterSearchOrder() # create a TwitterSearchOrder object
 
    tso.set_keywords(['Guttenberg', 'Doktorarbeit']) # let's define all words we would like to have a look for
 
    tso.set_language('de') # we want to see German tweets only
 
    tso.set_include_entities(False) # and don't give us all those entity information
 
  
    # it's about time to create a TwitterSearch object with our secret tokens
+
tso = TwitterSearchOrder() # create a TwitterSearchOrder object
    ts = TwitterSearch(
+
keywords=("ahok","anies","sylv")
        consumer_key = 'aaabbb',
+
tso.set_keywords(keywords, or_operator=True)
        consumer_secret = 'cccddd',
+
tso.set_language('id') # we want to see Indonesian tweets only
        access_token = '111222',
+
tso.set_include_entities(True) # and don't give us all those entity information
        access_token_secret = '333444'
+
 +
# it's about time to create a TwitterSearch object with our secret tokens
 +
ts = TwitterSearch(
 +
    consumer_key = ' ',
 +
    consumer_secret = ' ',
 +
    access_token = ' ',
 +
    access_token_secret = ' '
 
     )
 
     )
 +
 +
# this is where the fun actually starts :)
 +
for tweet in ts.search_tweets_iterable(tso):
 +
print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'].encode('ascii','ignore').decode('ascii') ) )
  
    # this is where the fun actually starts :)
 
    for tweet in ts.search_tweets_iterable(tso):
 
        print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )
 
  
except TwitterSearchException as e: # take care of all those ugly errors if there are some
+
Hasilnya kira-kira sebagai berikut,
    print(e)
 
  
The result will be a text looking similar to this one. But as you see unfortunately there is no idea hidden in those tweets how to get your doctorate thesis without any work. Damn it!
+
@suaramasjid tweeted: Inilah Kesaksian Ketum MUI di Persidangan Ahok https://t.co/KLKBLgYu5q
 
+
@miraclecathy tweeted: Penyakit kanker, jantung, otak itu hrs segera ditangani. Ga ada lg ditolak RS dgn alasan penuh. Ahok peduli itu, sy https://t.co/F2jjtwOkn3
  @enricozero tweeted: RT @viehdeo: Archiv: Comedy-Video: Oliver Welke parodiert “Mogelbaron” Dr. Guttenbergs Doktorarbeit (Schummel-cum-laude Pla... http://t. ...
+
@sofandresyabanu tweeted: Sidang Kemarin, Ahok Tidak Bertujuan Laporkan Saksi Maruf Amin, Tapi Mau Laporkan Habib Muchsin  https://t.co/B2UISB8xwg
  @schlagworte tweeted: "Erst letztens habe ich in meiner Doktorarbeit Guttenberg zitiert." Blockflöte des Todes: http://t.co/pCzIn429
+
  @ugisugiharto tweeted: RT @elpanjullo: https://t.co/EwE2J6YFVw Ayeeeee dukung artikel bagus ini... Bebaskan Ahok Demi Keadilan! Ayeeeee 100% yakin Ahok Kagak Sala
  @nkoni7 tweeted: Familien sind auch betroffen wenn schlechte Politik gemacht wird. Nicht nur wenn Guttenberg seine Doktorarbeit fälscht ! #absolutemehrheit
+
@bangher_id tweeted: RT pks_cakung: Emak3 siap mengantarkan Anies Sandi menjadi gubernur DKI #AniesSandiIdamanRakyat PKSJakarta JktMaju https://t.co/jIvPcQGgSy
 +
  @atopbbx tweeted: Metro_TV: [Populer] Jubir Ahok: Megawati Bakal Turun di Saat yang Tepat https://t.co/d9ekkDaCF7 https://t.co/xiKzYanGZF
 +
  @ssirah tweeted: RT @nay_shihab: KH Didin Hafidhuddin: Ahok Membahayakan
 +
https://t.co/M65Fkx0es8
  
 
==Access User Timelines==
 
==Access User Timelines==
  
You’re thinking that the global wisdom of Twitter is way too much for your needs? Well, let’s query a timeline of a certain user than:
+
Alternatif lain adalah mengakses timeline seorang user saja, caranya,
 
 
from TwitterSearch import *
 
 
 
try:
 
    tuo = TwitterUserOrder('NeinQuarterly') # create a TwitterUserOrder
 
 
 
    # it's about time to create TwitterSearch object again
 
    ts = TwitterSearch(
 
        consumer_key = 'aaabbb',
 
        consumer_secret = 'cccddd',
 
        access_token = '111222',
 
        access_token_secret = '333444'
 
    )
 
 
 
    # start asking Twitter about the timeline
 
    for tweet in ts.search_tweets_iterable(tuo):
 
        print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )
 
 
 
except TwitterSearchException as e: # catch all those ugly errors
 
    print(e)
 
 
 
You may guess the resulting output, but here it is anyway:
 
 
 
@NeinQuarterly tweeted: To make a long story short: Twitter.
 
@NeinQuarterly tweeted: A German subordinating conjunction walks into a bar. Three hours later it's joined by a verb.
 
@NeinQuarterly tweeted: Foucault walks into a bar. No one notices.
 
@NeinQuarterly tweeted: If it's not deleted, probably wasn't worth writing.
 
@NeinQuarterly tweeted: Trust me: German prepositions aren't laughing with you. They're laughing at you.
 
@NeinQuarterly tweeted: Another beautiful day for cultural pessimism.
 
@NeinQuarterly tweeted: Excuse me, sir. Your Zeitgeist has arrived.
 
 
 
==Interested in some more details?==
 
 
 
If you’d like to get more information about how TwitterSearch works internally and how to use it with all it’s possibilities have a look at the latest documentation. A changelog is also available within this repository.
 
Updating to 1.0.0 and newer
 
 
 
If you’re upgrading from a version < 1.0.0 be aware that the API changed! As part of the process to obtain PEP-8 compatibility all methods had to be renamed. The code changes to support the PEP-8 naming scheme are trivial. Just change the old method naming scheme from setKeywords(...) to the new one of set_keywords(...).
 
 
 
Apart from this issue, four other API changes were introduced with version 1.0.0:
 
 
 
* simplified proxy functionality (no usage of dicts but plain strings as only HTTPS proxies can be supported anyway)
 
* simplified geo-code parameter (TwitterSearchOrder.set_geocode(...,metric=True) renamed to set_geocode(...,imperial_metric=True))
 
* simplified TwitterSearch.get_statistics() from dict to tuple style ({'queries':<int>, 'tweets':<int>} to (<int>,<int>))
 
* additional feature: timelines of users can now be accessed using the new class TwitterUserOrder
 
 
 
In total those changes can be done quickly without browsing the documentation.
 
 
 
If you’re unable apply those changes, you might consider using TwitterSearch versions < 1.0.0. Those will stay available through pypi and therefore will be installable in the future using the common installation methods like pip install -I TwitterSearch==0.78.6. Using the release tags is another easy way to navigate through all versions of this library.
 
  
 +
from TwitterSearch import *
 +
 +
tuo = TwitterUserOrder('onnowpurbo') # create a TwitterUserOrder
 +
 +
# it's about time to create a TwitterSearch object with our secret tokens
 +
ts = TwitterSearch(
 +
    consumer_key = ' ',
 +
    consumer_secret = ' ',
 +
    access_token = ' ',
 +
    access_token_secret = ' '
 +
    )
 +
 +
# start asking Twitter about the timeline
 +
for tweet in ts.search_tweets_iterable(tuo):
 +
print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )
  
 +
Hasilnya kira-kira,
  
 +
@onnowpurbo tweeted: alhamdulillah :) ...  https://t.co/JRQotgThNF
 +
@onnowpurbo tweeted: keren! ... https://t.co/xuCqpVJMAd
 +
@onnowpurbo tweeted: sama2 lihat aktifitas di pulpstone kayanya menarik nih .. https://t.co/PN3nnf0ztW
 +
@onnowpurbo tweeted: openwrt dulunya utk router, cuma dengan kemampuan yang ada bisa utk macem2 :) .. https://t.co/3VZgng1RLd
 +
@onnowpurbo tweeted: bisa diskusi via email onno@indo.net.id, beraba besar komunitas pulpstone? https://t.co/GEksxU49Zy
  
 
==Referensi==
 
==Referensi==
  
 
* https://pypi.python.org/pypi/TwitterSearch/
 
* https://pypi.python.org/pypi/TwitterSearch/
 +
* https://github.com/ckoepp/TwitterSearch/tree/master/docs
 +
* https://github.com/ckoepp/TwitterSearch/blob/master/tests/test_tso.py

Latest revision as of 12:02, 2 February 2017

Instalasi

pip install TwitterSearch

atau

easy_install TwitterSearch


Search Twitter

Contoh search dengan beberapa keyword fungsi OR

from TwitterSearch import *

tso = TwitterSearchOrder() # create a TwitterSearchOrder object
keywords=("ahok","anies","sylv")
tso.set_keywords(keywords, or_operator=True)
tso.set_language('id') # we want to see Indonesian tweets only
tso.set_include_entities(True) # and don't give us all those entity information

# it's about time to create a TwitterSearch object with our secret tokens
ts = TwitterSearch(
    consumer_key = ' ',
    consumer_secret = ' ',
    access_token = ' ',
    access_token_secret = ' '
    )

# this is where the fun actually starts :)
for tweet in ts.search_tweets_iterable(tso):
	print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'].encode('ascii','ignore').decode('ascii') ) )


Hasilnya kira-kira sebagai berikut,

@suaramasjid tweeted: Inilah Kesaksian Ketum MUI di Persidangan Ahok https://t.co/KLKBLgYu5q
@miraclecathy tweeted: Penyakit kanker, jantung, otak itu hrs segera ditangani. Ga ada lg ditolak RS dgn alasan penuh. Ahok peduli itu, sy https://t.co/F2jjtwOkn3
@sofandresyabanu tweeted: Sidang Kemarin, Ahok Tidak Bertujuan Laporkan Saksi Maruf Amin, Tapi Mau Laporkan Habib Muchsin  https://t.co/B2UISB8xwg
@ugisugiharto tweeted: RT @elpanjullo: https://t.co/EwE2J6YFVw Ayeeeee dukung artikel bagus ini... Bebaskan Ahok Demi Keadilan! Ayeeeee 100% yakin Ahok Kagak Sala
@bangher_id tweeted: RT pks_cakung: Emak3 siap mengantarkan Anies Sandi menjadi gubernur DKI #AniesSandiIdamanRakyat PKSJakarta JktMaju https://t.co/jIvPcQGgSy
@atopbbx tweeted: Metro_TV: [Populer] Jubir Ahok: Megawati Bakal Turun di Saat yang Tepat https://t.co/d9ekkDaCF7 https://t.co/xiKzYanGZF
@ssirah tweeted: RT @nay_shihab: KH Didin Hafidhuddin: Ahok Membahayakan

https://t.co/M65Fkx0es8

Access User Timelines

Alternatif lain adalah mengakses timeline seorang user saja, caranya,

from TwitterSearch import *

tuo = TwitterUserOrder('onnowpurbo') # create a TwitterUserOrder

# it's about time to create a TwitterSearch object with our secret tokens
ts = TwitterSearch(
    consumer_key = ' ',
    consumer_secret = ' ',
    access_token = ' ',
    access_token_secret = ' '
    )

# start asking Twitter about the timeline
for tweet in ts.search_tweets_iterable(tuo):
	print( '@%s tweeted: %s' % ( tweet['user']['screen_name'], tweet['text'] ) )

Hasilnya kira-kira,

@onnowpurbo tweeted: alhamdulillah :) ...  https://t.co/JRQotgThNF
@onnowpurbo tweeted: keren! ... https://t.co/xuCqpVJMAd
@onnowpurbo tweeted: sama2 lihat aktifitas di pulpstone kayanya menarik nih .. https://t.co/PN3nnf0ztW
@onnowpurbo tweeted: openwrt dulunya utk router, cuma dengan kemampuan yang ada bisa utk macem2 :) .. https://t.co/3VZgng1RLd
@onnowpurbo tweeted: bisa diskusi via email onno@indo.net.id, beraba besar komunitas pulpstone? https://t.co/GEksxU49Zy

Referensi