Skip to content

TomWartm/ITJobsCH

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Job Market Insights

This is a script that scrapes jobs.ch and itjobs.ch, based on keywords related to IT, to summarize, visualize, and filter interesting jobs.

Last Data Downloaded: 26 February, 2025

Plot of Publication Date Distribution

  • A line plot showing the daily count of publications since 1.1.2025.

For more analysis, see analysis.md

** Instructions**

1. Create and Activate the Conda Environment

Create a Conda environment called itjobs from the environment.yml file and install the required dependencies.

conda env create -f environment.yml
conda activate itjobs

2. Run the Scraper

If you want to download the latest data from jobs.ch and/or itjobs.ch run the scraper.

Alternatively, you can skip this step and use the already downloaded data in data/jobs.json

python src/scraper.py

3. Preprocess the Data

To clean and extract important information from the raw file jobs.json, run the preprocessing script, which will create the jobs_processed.json file in the data folder.

Alternatively, you can skip this step and use the already preprocessed data in data/jobs_preprocessed.json

python src/preprocessing.py

Warning: if you change preprocessing.py, you should delete jobs_preprocessed.json and run it again. Howeer, you will loose your ratings on the jobs.

4. Review jobs

You can review the downloaded jobs by running review. This each downloaded job (that is not rated so far) and the user can rate the job from 0-9.

python src/review.py

About

Scraping and Analysis of jobs.ch

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published