OS
OSRepos
HomeRepositoriesRSS

Repository History

Explore all analyzed open source repositories

Topic: Data Extraction
sitefetch: Efficiently Scrape Websites for AI Model Training and Analysis

sitefetch: Efficiently Scrape Websites for AI Model Training and Analysis

sitefetch is a powerful command-line utility designed to fetch and save entire websites as plain text files. This tool is particularly useful for preparing large datasets for AI model training, allowing easy consumption of web content. It offers flexible options for page matching and content selection, ensuring relevant data extraction.

Oct 12, 2025
View Details
Scrapling: An Undetectable, Powerful, and Adaptive Python Web Scraping Library

Scrapling: An Undetectable, Powerful, and Adaptive Python Web Scraping Library

Scrapling is a high-performance Python library designed for effortless web scraping. It stands out with its adaptive capabilities, automatically adjusting to website changes, and advanced stealth features to bypass anti-bot systems. This makes it a robust solution for modern web data extraction needs.

Oct 11, 2025
View Details
Page 1
OS
OSRepos

Analysis and discovery of open source repositories. Find interesting projects and follow their updates.

Navigation

HomeRepositoriesSitemapRSS Feed

Legal

Privacy PolicyCookie Policy

© 2025 OSRepos. Built with Nuxt 3 and lots of ❤️

This site uses cookies to improve your experience. By continuing to browse, you agree to our Cookie Policy.