A powerful Python script that allows you to scrape messages and media from Telegram channels using the Telethon library. Features include real-time continuous scraping, media downloading, and data export capabilities.
___________________ _________
\__ ___/ _____/ / _____/
| | / \ ___ \_____ \
| | \ \_\ \/ \
|____| \______ /_______ /
\/ \/
Before running the script, you'll need:
pip install -r requirements.txt
Contents of requirements.txt
:
telethon
aiohttp
asyncio
api_id
: A numberapi_hash
: A string of letters and numbersKeep these credentials safe, you'll need them to run the script!
git clone https://github.com/unnohwn/telegram-scraper.git
cd telegram-scraper
pip install -r requirements.txt
python telegram-scraper.py
When scraping a channel for the first time, please note:
The script provides an interactive menu with the following options:
You can use either: - Channel username (e.g., channelname
) - Channel ID (e.g., -1001234567890
)
Data is stored in SQLite databases, one per channel: - Location: ./channelname/channelname.db
- Table: messages
- id
: Primary key - message_id
: Telegram message ID - date
: Message timestamp - sender_id
: Sender's Telegram ID - first_name
: Sender's first name - last_name
: Sender's last name - username
: Sender's username - message
: Message text - media_type
: Type of media (if any) - media_path
: Local path to downloaded media - reply_to
: ID of replied message (if any)
Media files are stored in: - Location: ./channelname/media/
- Files are named using message ID or original filename
Data can be exported in two formats: 1. CSV: ./channelname/channelname.csv
- Human-readable spreadsheet format - Easy to import into Excel/Google Sheets
./channelname/channelname.json
The continuous scraping feature ([C]
option) allows you to: - Monitor channels in real-time - Automatically download new messages - Download media as it's posted - Run indefinitely until interrupted (Ctrl+C) - Maintains state between runs
The script can download: - Photos - Documents - Other media types supported by Telegram - Automatically retries failed downloads - Skips existing files to avoid duplicates
The script includes: - Automatic retry mechanism for failed media downloads - State preservation in case of interruption - Flood control compliance - Error logging for failed operations
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
This tool is for educational purposes only. Make sure to: - Respect Telegram's Terms of Service - Obtain necessary permissions before scraping - Use responsibly and ethically - Comply with data protection regulations
A Python script that allows you to automatically scrape and download stories from your Telegram friends using the Telethon library. The script continuously monitors and saves both photos and videos from stories, along with their metadata.
Due to Telegram API restrictions, this script can only access stories from: - Users you have added to your friend list - Users whose privacy settings allow you to view their stories
This is a limitation of Telegram's API and cannot be bypassed.
Before running the script, you'll need:
pip install -r requirements.txt
Contents of requirements.txt
:
telethon
openpyxl
schedule
api_id
: A numberapi_hash
: A string of letters and numbersKeep these credentials safe, you'll need them to run the script!
git clone https://github.com/unnohwn/telegram-story-scraper.git
cd telegram-story-scraper
pip install -r requirements.txt
python TGSS.py
The script: 1. Connects to your Telegram account 2. Periodically checks for new stories from your friends 3. Downloads any new stories (photos/videos) 4. Stores metadata in a SQLite database 5. Exports information to an Excel file 6. Runs continuously until interrupted (Ctrl+C)
SQLite database containing: - user_id
: Telegram user ID of the story creator - story_id
: Unique story identifier - timestamp
: When the story was posted (UTC+2) - filename
: Local filename of the downloaded media
Export file containing the same information as the database, useful for: - Easy viewing of story metadata - Filtering and sorting - Data analysis - Sharing data with others
{user_id}_{story_id}.jpg
{user_id}_{story_id}.{extension}
The script includes: - Automatic retry mechanism for failed downloads - Error logging for failed operations - Connection error handling - State preservation in case of interruption
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
This tool is for educational purposes only. Make sure to: - Respect Telegram's Terms of Service - Obtain necessary permissions before scraping - Use responsibly and ethically - Comply with data protection regulations - Respect user privacy
Introducing Uscrapper 2.0, A powerfull OSINT webscrapper that allows users to extract various personal information from a website. It leverages web scraping techniques and regular expressions to extract email addresses, social media links, author names, geolocations, phone numbers, and usernames from both hyperlinked and non-hyperlinked sources on the webpage, supports multithreading to make this process faster, Uscrapper 2.0 is equipped with advanced Anti-webscrapping bypassing modules and supports webcrawling to scrape from various sublinks within the same domain. The tool also provides an option to generate a report containing the extracted details.
Uscrapper extracts the following details from the provided website:
Uscrapper 2.0:
git clone https://github.com/z0m31en7/Uscrapper.git
cd Uscrapper/install/
chmod +x ./install.sh && ./install.sh #For Unix/Linux systems
To run Uscrapper, use the following command-line syntax:
python Uscrapper-v2.0.py [-h] [-u URL] [-c (INT)] [-t THREADS] [-O] [-ns]
Arguments:
Uscrapper relies on web scraping techniques to extract information from websites. Make sure to use it responsibly and in compliance with the website's terms of service and applicable laws.
The accuracy and completeness of the extracted details depend on the structure and content of the website being analyzed.
To bypass some Anti-Webscrapping methods we have used selenium which can make the overall process slower.
Efficiently finding registered accounts from emails.
Holehe checks if an email is attached to an account on sites like twitter, instagram, imgur and more than 120 others.
pip3 install holehe
git clone https://github.com/megadose/holehe.git
cd holehe/
python3 setup.py install
Holehe can be run from the CLI and rapidly embedded within existing python applications.
holehe test@gmail.com
import trio
import httpx
from holehe.modules.social_media.snapchat import snapchat
async def main():
email = "test@gmail.com"
out = []
client = httpx.AsyncClient()
await snapchat(email, client, out)
print(out)
await client.aclose()
trio.run(main)
For each module, data is returned in a standard dictionary with the following json-equivalent format :
{
"name": "example",
"rateLimit": false,
"exists": true,
"emailrecovery": "ex****e@gmail.com",
"phoneNumber": "0*******78",
"others": null
}
Rate limit? Change your IP.
For BTC Donations : 1FHDM49QfZX6pJmhjLE5tB2K6CaTLMZpXZ
GNU General Public License v3.0
Built for educational purposes only.
Name | Domain | Method | Frequent Rate Limit |
---|---|---|---|
aboutme | about.me | register | โ |
adobe | adobe.com | password recovery | โ |
amazon | amazon.com | login | โ |
amocrm | amocrm.com | register | โ |
anydo | any.do | login | โ |
archive | archive.org | register | โ |
armurerieauxerre | armurerie-auxerre.com | register | โ |
atlassian | atlassian.com | register | โ |
axonaut | axonaut.com | register | โ |
babeshows | babeshows.co.uk | register | โ |
badeggsonline | badeggsonline.com | register | โ |
biosmods | bios-mods.com | register | โ |
biotechnologyforums | biotechnologyforums.com | register | โ |
bitmoji | bitmoji.com | login | โ |
blablacar | blablacar.com | register | โ |
blackworldforum | blackworldforum.com | register | โ |
blip | blip.fm | register | โ |
blitzortung | forum.blitzortung.org | register | โ |
bluegrassrivals | bluegrassrivals.com | register | โ |
bodybuilding | bodybuilding.com | register | โ |
buymeacoffee | buymeacoffee.com | register | โ |
cambridgemt | discussion.cambridge-mt.com | register | โ |
caringbridge | caringbridge.org | register | โ |
chinaphonearena | chinaphonearena.com | register | โ |
clashfarmer | clashfarmer.com | register | โ |
codecademy | codecademy.com | register | โ |
codeigniter | forum.codeigniter.com | register | โ |
codepen | codepen.io | register | โ |
coroflot | coroflot.com | register | โ |
cpaelites | cpaelites.com | register | โ |
cpahero | cpahero.com | register | โ |
cracked_to | cracked.to | register | โ |
crevado | crevado.com | register | โ |
deliveroo | deliveroo.com | register | โ |
demonforums | demonforums.net | register | โ |
devrant | devrant.com | register | โ |
diigo | diigo.com | register | โ |
discord | discord.com | register | โ |
docker | docker.com | register | โ |
dominosfr | dominos.fr | register | โ |
ebay | ebay.com | login | โ |
ello | ello.co | register | โ |
envato | envato.com | register | โ |
eventbrite | eventbrite.com | login | โ |
evernote | evernote.com | login | โ |
fanpop | fanpop.com | register | โ |
firefox | firefox.com | register | โ |
flickr | flickr.com | login | โ |
freelancer | freelancer.com | register | โ |
freiberg | drachenhort.user.stunet.tu-freiberg.de | register | โ |
garmin | garmin.com | register | โ |
github | github.com | register | โ |
google.com | register | โ | |
gravatar | gravatar.com | other | โ |
hubspot | hubspot.com | login | โ |
imgur | imgur.com | register | โ |
insightly | insightly.com | login | โ |
instagram.com | register | โ | |
issuu | issuu.com | register | โ |
koditv | forum.kodi.tv | register | โ |
komoot | komoot.com | register | โ |
laposte | laposte.fr | register | โ |
lastfm | last.fm | register | โ |
lastpass | lastpass.com | register | โ |
mail_ru | mail.ru | password recovery | โ |
mybb | community.mybb.com | register | โ |
myspace | myspace.com | register | โ |
nattyornot | nattyornotforum.nattyornot.com | register | โ |
naturabuy | naturabuy.fr | register | โ |
ndemiccreations | forum.ndemiccreations.com | register | โ |
nextpvr | forums.nextpvr.com | register | โ |
nike | nike.com | register | โ |
nimble | nimble.com | register | โ |
nocrm | nocrm.io | register | โ |
nutshell | nutshell.com | register | โ |
odnoklassniki | ok.ru | password recovery | โ |
office365 | office365.com | other | โ |
onlinesequencer | onlinesequencer.net | register | โ |
parler | parler.com | login | โ |
patreon | patreon.com | login | โ |
pinterest.com | register | โ | |
pipedrive | pipedrive.com | register | โ |
plurk | plurk.com | register | โ |
pornhub | pornhub.com | register | โ |
protonmail | protonmail.ch | other | โ |
quora | quora.com | register | โ |
rambler | rambler.ru | register | โ |
redtube | redtube.com | register | โ |
replit | replit.com | register | โ |
rocketreach | rocketreach.co | register | โ |
samsung | samsung.com | register | โ |
seoclerks | seoclerks.com | register | โ |
sevencups | 7cups.com | register | โ |
smule | smule.com | register | โ |
snapchat | snapchat.com | login | โ |
soundcloud | soundcloud.com | register | โ |
sporcle | sporcle.com | register | โ |
spotify | spotify.com | register | โ |
strava | strava.com | register | โ |
taringa | taringa.net | register | โ |
teamleader | teamleader.com | register | โ |
teamtreehouse | teamtreehouse.com | register | โ |
tellonym | tellonym.me | register | โ |
thecardboard | thecardboard.org | register | โ |
therianguide | forums.therian-guide.com | register | โ |
thevapingforum | thevapingforum.com | register | โ |
tumblr | tumblr.com | register | โ |
tunefind | tunefind.com | register | โ |
twitter.com | register | โ | |
venmo | venmo.com | register | โ |
vivino | vivino.com | register | โ |
voxmedia | voxmedia.com | register | โ |
vrbo | vrbo.com | register | โ |
vsco | vsco.co | register | โ |
wattpad | wattpad.com | register | โ |
wordpress | wordpress | login | โ |
xing.com | register | โ | |
xnxx | xnxx.com | register | โ |
xvideos | xvideos.com | register | โ |
yahoo | yahoo.com | login | โ |
zoho | zoho.com | login | โ |