This programmme scrap the GSoC 2021 website (https://summerofcode.withgoogle.com/projects/) and stores the data in a csv file in the following format: Name,Organization,Project It uses 'requests', 'bs4',and 'selenium' libraries in python.
- The programme is written for system having 'Chrome' installed and also, 'chromedriver' must be present on the directory in which the programme is run.
- On line 13 of the python code, 'SCROLL_PAUSE_TIME' is the time difference (in seconds) between consecutive scrolls. Although, the default value has been put after testing few cases, it should be modified by user depending upon internet speed. A slower internet connection user may increase the 'SCROLL_PAUSE_TIME'.
- Then, one may procced to run the code (python3 gsoc2021.py) in the terminal. It may take about 15 minutes to execute.
- The data will be written in a csv file named 'gsoc2021.csv' in the same directory where the programme is located and run.
- The data will be written into the csv file in the following format Name,Organisation,Project