Telegram-Scraper - A Powerful Python Script That Allows You To Scrape Messages And Media From Telegram Channels Using The Telethon Library

By: Unknown

A powerful Python script that allows you to scrape messages and media from Telegram channels using the Telethon library. Features include real-time continuous scraping, media downloading, and data export capabilities.

___________________  _________
\__    ___/  _____/ /   _____/
  |    | /   \  ___ \_____  \ 
  |    | \    \_\  \/        \
  |____|  \______  /_______  /
                 \/        \/

Features 🚀

Scrape messages from multiple Telegram channels
Download media files (photos, documents)
Real-time continuous scraping
Export data to JSON and CSV formats
SQLite database storage
Resume capability (saves progress)
Media reprocessing for failed downloads
Progress tracking
Interactive menu interface

Prerequisites 📋

Before running the script, you'll need:

Python 3.7 or higher
Telegram account
API credentials from Telegram

Required Python packages

pip install -r requirements.txt

Contents of requirements.txt:

telethon
aiohttp
asyncio

Getting Telegram API Credentials 🔑

Visit https://my.telegram.org/auth
Log in with your phone number
Click on "API development tools"
Fill in the form:
App title: Your app name
Short name: Your app short name
Platform: Can be left as "Desktop"
Description: Brief description of your app
Click "Create application"
You'll receive:
api_id: A number
api_hash: A string of letters and numbers

Keep these credentials safe, you'll need them to run the script!

Setup and Running 🔧

Clone the repository:

git clone https://github.com/unnohwn/telegram-scraper.git
cd telegram-scraper

Install requirements:

pip install -r requirements.txt

Run the script:

python telegram-scraper.py

On first run, you'll be prompted to enter:
Your API ID
Your API Hash
Your phone number (with country code)
Your phone number (with country code) or bot, but use the phone number option when prompted second time.
Verification code (sent to your Telegram)

Initial Scraping Behavior 🕒

When scraping a channel for the first time, please note:

The script will attempt to retrieve the entire channel history, starting from the oldest messages
Initial scraping can take several minutes or even hours, depending on:
The total number of messages in the channel
Whether media downloading is enabled
The size and number of media files
Your internet connection speed
Telegram's rate limiting
The script uses pagination and maintains state, so if interrupted, it can resume from where it left off
Progress percentage is displayed in real-time to track the scraping status
Messages are stored in the database as they are scraped, so you can start analyzing available data even before the scraping is complete

Usage 📝

The script provides an interactive menu with the following options:

[A] Add new channel
Enter the channel ID or channelname
[R] Remove channel
Remove a channel from scraping list
[S] Scrape all channels
One-time scraping of all configured channels
[M] Toggle media scraping
Enable/disable downloading of media files
[C] Continuous scraping
Real-time monitoring of channels for new messages
[E] Export data
Export to JSON and CSV formats
[V] View saved channels
List all saved channels
[L] List account channels
List all channels with ID:s for account
[Q] Quit

Channel IDs 📢

You can use either: - Channel username (e.g., channelname) - Channel ID (e.g., -1001234567890)

Data Storage 💾

Database Structure

Data is stored in SQLite databases, one per channel: - Location: ./channelname/channelname.db - Table: messages - id: Primary key - message_id: Telegram message ID - date: Message timestamp - sender_id: Sender's Telegram ID - first_name: Sender's first name - last_name: Sender's last name - username: Sender's username - message: Message text - media_type: Type of media (if any) - media_path: Local path to downloaded media - reply_to: ID of replied message (if any)

Media Storage 📁

Media files are stored in: - Location: ./channelname/media/ - Files are named using message ID or original filename

Exported Data 📊

Data can be exported in two formats: 1. CSV: ./channelname/channelname.csv - Human-readable spreadsheet format - Easy to import into Excel/Google Sheets

JSON: ./channelname/channelname.json
Structured data format
Ideal for programmatic processing

Features in Detail 🔍

Continuous Scraping

The continuous scraping feature ([C] option) allows you to: - Monitor channels in real-time - Automatically download new messages - Download media as it's posted - Run indefinitely until interrupted (Ctrl+C) - Maintains state between runs

Media Handling

The script can download: - Photos - Documents - Other media types supported by Telegram - Automatically retries failed downloads - Skips existing files to avoid duplicates

Error Handling 🛠️

The script includes: - Automatic retry mechanism for failed media downloads - State preservation in case of interruption - Flood control compliance - Error logging for failed operations

Limitations ⚠️

Respects Telegram's rate limits
Can only access public channels or channels you're a member of
Media download size limits apply as per Telegram's restrictions

Contributing 🤝

Contributions are welcome! Please feel free to submit a Pull Request.

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Disclaimer ⚖️

This tool is for educational purposes only. Make sure to: - Respect Telegram's Terms of Service - Obtain necessary permissions before scraping - Use responsibly and ethically - Comply with data protection regulations

Download Telegram-Scraper

Telegram-Story-Scraper - A Python Script That Allows You To Automatically Scrape And Download Stories From Your Telegram Friends

By: Unknown

A Python script that allows you to automatically scrape and download stories from your Telegram friends using the Telethon library. The script continuously monitors and saves both photos and videos from stories, along with their metadata.

Important Note About Story Access ⚠️

Due to Telegram API restrictions, this script can only access stories from: - Users you have added to your friend list - Users whose privacy settings allow you to view their stories

This is a limitation of Telegram's API and cannot be bypassed.

Features 🚀

Automatically scrapes all available stories from your Telegram friends
Downloads both photos and videos from stories
Stores metadata in SQLite database
Exports data to Excel spreadsheet
Real-time monitoring with customizable intervals
Timestamp is set to (UTC+2)
Maintains record of previously downloaded stories
Resume capability
Automatic retry mechanism

Prerequisites 📋

Before running the script, you'll need:

Python 3.7 or higher
Telegram account
API credentials from Telegram
Friends on Telegram whose stories you want to track

Required Python packages

pip install -r requirements.txt

Contents of requirements.txt:

telethon
openpyxl
schedule

Getting Telegram API Credentials 🔑

Visit https://my.telegram.org/auth
Log in with your phone number
Click on "API development tools"
Fill in the form:
App title: Your app name
Short name: Your app short name
Platform: Can be left as "Desktop"
Description: Brief description of your app
Click "Create application"
You'll receive:
api_id: A number
api_hash: A string of letters and numbers

Keep these credentials safe, you'll need them to run the script!

Setup and Running 🔧

Clone the repository:

git clone https://github.com/unnohwn/telegram-story-scraper.git
cd telegram-story-scraper

Install requirements:

pip install -r requirements.txt

Run the script:

python TGSS.py

On first run, you'll be prompted to enter:
Your API ID
Your API Hash
Your phone number (with country code)
Verification code (sent to your Telegram)
Checking interval in seconds (default is 60)

How It Works 🔄

The script: 1. Connects to your Telegram account 2. Periodically checks for new stories from your friends 3. Downloads any new stories (photos/videos) 4. Stores metadata in a SQLite database 5. Exports information to an Excel file 6. Runs continuously until interrupted (Ctrl+C)

Data Storage 💾

Database Structure (stories.db)

SQLite database containing: - user_id: Telegram user ID of the story creator - story_id: Unique story identifier - timestamp: When the story was posted (UTC+2) - filename: Local filename of the downloaded media

CSV and Excel Export (stories_export.csv/xlsx)

Export file containing the same information as the database, useful for: - Easy viewing of story metadata - Filtering and sorting - Data analysis - Sharing data with others

Media Storage 📁

Photos are saved as: {user_id}_{story_id}.jpg
Videos are saved with their original extension: {user_id}_{story_id}.{extension}
All media files are saved in the script's directory

Features in Detail 🔍

Continuous Monitoring

Customizable checking interval (default: 60 seconds)
Runs continuously until manually stopped
Maintains state between runs
Avoids duplicate downloads

Media Handling

Supports both photos and videos
Automatically detects media type
Preserves original quality
Generates unique filenames

Error Handling 🛠️

The script includes: - Automatic retry mechanism for failed downloads - Error logging for failed operations - Connection error handling - State preservation in case of interruption

Limitations ⚠️

Subject to Telegram's rate limits
Stories must be currently active (not expired)
Media download size limits apply as per Telegram's restrictions

Contributing 🤝

Contributions are welcome! Please feel free to submit a Pull Request.

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Disclaimer ⚖️

Download Telegram-Story-Scraper

FreshRSS

Telegram-Scraper - A Powerful Python Script That Allows You To Scrape Messages And Media From Telegram Channels Using The Telethon Library

Features 🚀

Prerequisites 📋

Required Python packages

Getting Telegram API Credentials 🔑

Setup and Running 🔧

Initial Scraping Behavior 🕒

Usage 📝

Channel IDs 📢

Data Storage 💾

Database Structure

Media Storage 📁

Exported Data 📊

Features in Detail 🔍

Continuous Scraping

Media Handling

Error Handling 🛠️

Limitations ⚠️

Contributing 🤝

License 📄

Disclaimer ⚖️

Telegram-Story-Scraper - A Python Script That Allows You To Automatically Scrape And Download Stories From Your Telegram Friends

Important Note About Story Access ⚠️

Features 🚀

Prerequisites 📋

Required Python packages

Getting Telegram API Credentials 🔑

Setup and Running 🔧

How It Works 🔄

Data Storage 💾

Database Structure (stories.db)

CSV and Excel Export (stories_export.csv/xlsx)

Media Storage 📁

Features in Detail 🔍

Continuous Monitoring

Media Handling

Error Handling 🛠️

Limitations ⚠️

Contributing 🤝

License 📄

Disclaimer ⚖️