Skip to content

Added Scrape Tools + Deadlines Data Class

Vedant Gupta requested to merge feature/web-scraper into main

Scrape tools is a new file which holds scraper functions. Currently it only has one function, Scrape

Scrape(website_url, date_index, assignment_index)
website_url - the scrape target
date_index - the index (starting from 0) of the table column with the due date
assignment_index - the index (starting from 0) of the table column with the assignmed work

Output is a Deadlines object.

Deadlines data class contains MutableList<Deadline> and count. Each Deadline object holds the date and assigned work. Count is the size of the MutableList.

Current Scrape assumptions:

  • Table has 1 header row
  • Website uses table tags
  • There is 1 row for deadline and 1 row for assignments (our class website doesnt do this 😞 )

Main app file contains example of scraping our class website (Dates column and Submissions column)

Edited by Vedant Gupta

Merge request reports