Skip to content

SemonMamdouh/scraping_sigma

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ•ธ๏ธ Sigma Website Web Scraper (Efficient Python Notebook)

This project is a highly efficient web scraper built using Python in a Jupyter Notebook. It extracts product data from the Sigma website with optimal performance, clean code structure, and minimal resource usage โ€” making it a solid choice for scalable and customizable scraping tasks.

๐Ÿš€ Why This Notebook Is Efficient

โœ… Fast Page Parsing โ€“ Uses requests for HTTP calls and BeautifulSoup for lightweight HTML parsing.
โœ… Optimized Looping โ€“ Efficiently loops through pages and products without redundancy.
โœ… Clean Data Handling โ€“ Stores scraped data in a structured pandas DataFrame.
โœ… Easy to Modify โ€“ Well-commented, modular code allows easy customization (e.g., for different product categories).
โœ… Export Ready โ€“ Outputs data directly to a CSV file for further analysis or integration.

๐Ÿ“ฆ Extracted Data

The notebook extracts the following for each product:

  • Product Name
  • Product Link
  • Product Description

๐Ÿงฐ Tools & Libraries Used

  • requests โ€“ for sending HTTP requests
  • BeautifulSoup โ€“ for HTML parsing
  • pandas โ€“ for tabular data processing
  • csv โ€“ for exporting the results

๐Ÿ“‚ How It Works

  1. Send HTTP Request to the target Sigma product category page.
  2. Parse HTML Content using BeautifulSoup.
  3. Loop Through Product Listings to extract name, link, and description.
  4. Store Data in a pandas DataFrame.
  5. Export Results to a CSV file.

๐Ÿ› ๏ธ How to Run

  1. Open the notebook: scraping_sigma_website.ipynb
  2. Run all cells in order.
  3. A CSV file named sigma_products.csv will be generated with the results.

๐Ÿ“Œ Output Sample

Hereโ€™s a sample of the actual output:

Product Name Product Link Product Description
Aldrichยฎ Chemistry https://www.sigmaaldrich.com/US/en/products/aldrich Chemistry Products
Supelcoยฎ Analytical https://www.sigmaaldrich.com/US/en/products/supelco Analytical Products

โœ… The output is clean, structured, and ready for use in analysis or applications.

โš ๏ธ Legal Note

This scraper is for educational purposes only. Please ensure you are authorized to scrape content from the Sigma website and always respect their robots.txt and terms of use.

๐Ÿ“„ License

Open for academic and non-commercial use under the MIT License.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published