Hello guys in this article i’m going to show you How to scrape words from website’s & social media’s
Requirements:
- Linux
- internet
- Firefox webdriver
How to install firefox webdriver?
https://github.com/mozilla/geckodriver/releases
sudo mv geckodriver /usr/local/bin
Yes now the geckodriver installation was completed.
How to make password list with words-scraper?
git clone https://github.com/dariusztytko/words-scraper.git
Step 2:
Now we have words-scraper folder on our directory. open that folder using this below command.
cd words-scraper
After open the folder we are going to install the requirements of words-scraper tool so type this below command.
sudo pip3 install -r requirements.txt
Step 3:
Now we need to give the permission of read & write and execute the words-scraper by using following command.
chmod +x words-scraper.py
Step 3:
After that you need to run the following order to run the words-scraper
python2 words-scraper.py
This above command will show all attribute explanation and how to use this tool.
Step 4:
Now scrape words from your target website page so run the following command.
python3 words-scraper.py -o triplezero.txt ( you target website name with protocols)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghdA3wnb8ls8UyMo6H69N2Q4A08UtKmW7ocuI4Pi9MzdqeaQdfQ5d8DKIaH4b_NjPOWlPW55J27LUnJIQ1LCNuRVU8n_xcbDF1npPFsXwDoTrsmcJBw0vaE8vte69H1N77ea0w9W2Beham/w512-h288/Screenshot_2020-08-18_13-00-52.png)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1OW9AasgK6HzmfQrXXsUFdJg4ZqwMiuvRm8GbmSN2KuwAAVBhSm7wMFUlGNsd6vRAtMBfSUs572u3Semo3UmBxVepRlO4eosR_LwU01OzdIkPmJN7KMG6kzH-VoJbHYGYzOT8cBEbRFuV/w512-h288/Screenshot_2020-08-18_13-01-09.png)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhh7MgmnNFq1Ajjrt57HpJ86QqXMRoLfEArv1dX04bag2qXCVuezn94bSZwVQNQ2j9-PekVjvOZ9b7eNEi8OX_x51Gaqx9K81yJXhp_f5CQDISpgOgAbpBBUVdm6EySdIy3NPVcgtfqdcIz/w512-h288/Screenshot_2020-08-18_13-01-19.png)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgpGavCKsda2CEvdH39y89KroaeV1tVbPt3pVPESQZ8aCAn2RfggkweJYdSbPQ-O3PwokKEx8nDbCZ3Rp21XC1XD0q2tHGq7ZjoUninnn6_MtkTQz8KKG3Mulsd6DV5qOCrhcficYPiHzIa/w512-h288/Screenshot_2020-08-18_13-01-39.png)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwIzsHk7d1agc5k_E4WliB7-QyNXaglRkdLwLtiB6ycMC-e4eGc1Dvq5pIUKVJlIdii7WxxF-nGkWM7hN46WcWVWd9GfQFFsYtY1QfWrexOljHFoQc1_aDMtgtw4S-j5pva2_U1dTj8kph/w512-h288/Screenshot_2020-08-18_13-02-15.png)
Scraping words from the target’s pages:
python3 words-scraper.py -o words.txt https://www.example.com https://blog.example.com
Such generated words list can be used to perform online brute-force attack or for cracking password hashes:
hashcat -m 0 hashes.txt words.txt
Use –depth option to scrape words from the linked pages as well. Optional –show-gui switch may be used to track the progress and make a quick view of the page:
python3 words-scraper.py -o words.txt –depth 1 –show-gui https://www.example.com
Generated words list can be expanded by using words-converter.py script. This script removes special chars and accents. An example Polish word źdźbło! will be transformed into the following words:
źdźbło!
zdzblo!
źdźbło
zdzblo
cat words.txt | python3 words-converter.py | sort -u > words2.txt
Scraping words from the target’s Twitter
Twitter page is dynamically loaded while scrolling. Use –max-scrolls option to scrape words:
python3 words-scraper.py -o words.txt –max-scrolls 300 –show-gui https://twitter.com/example.com
ssh -D 1080 -Nf {USER-HERE}@{IP-HERE} >/dev/null 2>& $ python3 words-scraper.py -o words.txt –socks-proxy 127.0.0.1:1080 https://www.example.com
إرسال تعليق