Skip to content

This utility scrapes Canada's Express Entry Website and return all the draws, this was implemented to make an educated forecast for the Express Entry

License

Notifications You must be signed in to change notification settings

rubencosta13/Express-Entry-Scrapper

Repository files navigation

Express Entry CRS Scrapper

Table of Contents


About This Project

A simple understanding of this project:
I was looking for a way to extract data from a particular page, Canada's 🍁 Express Entry Draws.
So I could build a simple Forecast System, but the draws are already at #215 (At time of writing this ReadMe), which is a lot of data to copy and paste in a short period of time. So I came up with this solution A simple WebScrapper, which allow us to:

  • Make Screenshots of the pages
  • Export data to a .txt file as well as CSV file or Excel
  • Save time, retrieving data from the website

Project Status

Todo:

Add Docker and Start the Chromium window as Headless

How to Set Up this utility

  1. Head up to settings.json, this should be the output of the file
{
    "add_screenShots": false,
    "export_to_excel": false
}
"add_screenShots" -> This is responsible for taking page screenshots
"export_to_excel" -> This is responsible to insert data in a Excel Spreadsheet
  1. Enable or Disable the options you consider useful in your case scenario
  2. Navigate to the project directory:
$ cd scrapper # Navigate to project's directory
$ npm install # This installs all project's dependencies
$ npm start   # This starts the scrapper
Using npm start is going to start scrapping the webpage, a chromium window should pop up, in newer versions the process will be totally headless

Technologies Used

Help

If you need any help send me an email. Use the subject as: "Express Entry Scrapper"

Made with <3 by Ruben Costa

About

This utility scrapes Canada's Express Entry Website and return all the draws, this was implemented to make an educated forecast for the Express Entry

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

Packages

No packages published