I need scrapy code that extracts the required information from the file attached. No need for web scraping / logins etc. The required information is in the nested lists within each list of class "atl-list". The data should be output as tab-separated variable.
The required fields are:
- class - the text from the previous <h3> tag
- cluster - the text in the <strong> tag
- skills - the text in the level 2 list
- area - the text in the level 3 list
- indicator - the text in the level 4 list
So for example, the first two lines would be as follows.
Sciences (Year 7) \t Communication \t I. Communication skills \t Exchanging thoughts, messages and information effectively through interaction \t Collaborate with peers and experts using a variety of digital environments and media
Sciences (Year 7) \t Communication \t I. Communication skills \t Reading, writing and using language to gather and communicate information \t Make inferences and draw conclusions
"Hi,
Checked your HTML. Can create scrapser easily.
I am a scraping expert. Can scrap any website.
Scraped over 1021+ Websites/Sources. Skills: PHP, Python, Node
Please assign me this project.
Thanks!"
Hello !
I am experienced is using Scrapy python and do the work within 2 days. Will store the data in desired format, excel, csv, Json or text file.
Message me, so that we can discuss further and can start working.