Skip to content

georgidelchev/Web-Scrapers

Repository files navigation

Web Scrapers

A collection of C# web scrapers targeting Bulgarian websites — extracting recipes, news, jokes, and geographic data.

Language License Stars Forks


Four purpose-built scrapers for four different Bulgarian sites — turning HTML into structured, usable data.



📖 About

This repository is a collection of focused C# web scrapers, each targeting a specific Bulgarian website to extract structured data for use in other projects. The scrapers use AngleSharp for HTML parsing — the same library used in AYN and novini4ka — to traverse the DOM and extract exactly the data needed.

Each scraper is self-contained, simple to run, and outputs clean, normalised data ready for database seeding or further processing.


🗂️ Scrapers

🍳 01 — Gotvach.bg Scraper

Scrapes gotvach.bg — one of Bulgaria's most popular cooking and recipe portals.

Extracts recipe data including titles, ingredients, preparation steps, categories, and images — useful for seeding recipe databases or building food-related applications.

📁 Browse →


📰 02 — BtvNovinite Scraper

Scrapes btvnovinite.bg — the online news portal of bTV, Bulgaria's largest private television network.

Extracts news articles, headlines, publication dates, categories, and article content — used to seed the novini4ka news aggregation platform.

📁 Browse →


😂 03 — Vicove.com Scraper

Scrapes vicove.com — a popular Bulgarian jokes and humour website.

Extracts jokes by category — the data feeds the ViceIO entertainment platform and the PrograMEMEin' meme automation app.

📁 Browse →


🗺️ 04 — Bulgarian Neighbourhoods Scraper

Scrapes geographic reference data — Bulgarian neighbourhoods, quarters, and settlements.

Extracts structured location data used for address validation, delivery integrations, and geographic filtering in applications like AYN and the EcontAPI Wrapper.

📁 Browse →


🛠️ Tech Stack

Component Technology
Language C# (.NET)
HTML Parsing AngleSharp
HTTP System.Net.Http.HttpClient
IDE Visual Studio

🚀 Getting Started

  1. Clone the repository

    git clone https://github.com/georgidelchev/Web-Scrapers.git
    cd Web-Scrapers
  2. Open any scraper's .sln file in Visual Studio or Rider.

  3. Configure the output — by default, scraped data is written to the console or a JSON/database output. Check the specific scraper's Program.cs for configuration options.

  4. Run the scraper

    dotnet run

Note: Web scraping should always be done responsibly. Add delays between requests, respect robots.txt, and do not overload target servers.


🔗 Used In

These scrapers were built to feed data into other projects in this portfolio:

Scraper Used In
BtvNovinite novini4ka — news aggregation platform
Vicove.com ViceIO — entertainment platform
Vicove.com PrograMEMEin' — Instagram meme bot
Bulgarian Neighbourhoods AYN — marketplace platform
Bulgarian Neighbourhoods EcontAPI Wrapper — delivery integration

👤 Author

Georgi Delchev

GitHub LinkedIn Facebook


📄 License

This project is licensed under the MIT License.


Made with ☕ and AngleSharp · scrape responsibly

About

Some web scrapers.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages