Commit Graph

7 Commits

Author SHA1 Message Date
Torantulino
e6b794186e attempts to improve webpage summarisation. 2023-03-30 12:45:36 +01:00
Torantulino
23f19a8611 limits the number of links that a webpage can return. 2023-03-30 12:45:15 +01:00
Torantulino
26036d79ec Starts counting summarised chunks from 1. 2023-03-30 10:14:26 +01:00
Torantulino
114fc32d5f Adds hyperlink extraction from webpage
+ accompanying command.
2023-03-30 10:10:52 +01:00
Torantulino
693d141c86 Removes scrape_main_content function. 2023-03-29 09:43:32 +01:00
Torantulino
6d796d222d Adds error check to text scraper. 2023-03-29 09:43:10 +01:00
Torantulino
fc6c7bd8c4 Tides up codebase.
Extracts python functions to relevant files.
2023-03-28 23:25:42 +01:00