Automated Web Data Processing and Summarization

Open
Lux Development
Markham, Ontario, Canada
Employer
(2)
3
Project
Academic experience
60 hours per learner
Learner
Anywhere
Intermediate level

Project scope

Categories
Data analysis Information technology Software development Artificial intelligence Data science
Skills
web scraping data extraction data curation data processing python (programming language) data cleansing real world data management wordpress artificial intelligence
Details

The project aims to develop an automated system for web data extraction, cleansing, and summarization using Python. Lux Development seeks to streamline the process of gathering and processing web data to enhance efficiency and accuracy. The project involves creating two distinct automated processes. The first process will focus on web scraping to extract a list of items from specified websites, followed by data cleansing and curation into a structured CSV format. The second process will utilize AI to generate concise summaries of each website's content, subsequently formatting these summaries into a CSV file suitable for import into a WordPress directory. This project provides learners with the opportunity to apply their knowledge of Python programming, data processing, and AI to solve real-world data management challenges.

Deliverables

The project deliverables include two Python Notebooks, each implementing one of the automated processes. The first deliverable is a Python Notebook that scrapes websites, cleanses, and curates data into a specified CSV format. The second deliverable is a Python Notebook that uses AI to summarize website content and formats the summaries into a CSV file for WordPress import. Additionally, comprehensive documentation detailing the setup, execution, and functionality of each process will be provided.

Mentorship
Skills, knowledge and expertise

Sharing knowledge in specific technical skills, techniques, methodologies required for the project.

Hands-on support

Direct involvement in project tasks, offering guidance, and demonstrating techniques.

Tools and/or resources

Providing access to necessary tools, software, and resources required for project completion.

Regular meetings

Scheduled check-ins to discuss progress, address challenges, and provide feedback.

About the company

Company
Markham, Ontario, Canada
2 - 10 employees
Real estate

Lux Development focuses on investing in residential, multi-family, land and commercial assets which have growth potentials. We acquire, improve to add value and exit on assets with the goal of maximizing investor returns and minimizing risks, with sustainability in mind. Lux’s focal investment locations include the Greater Toronto Area (GTA) and the surrounding areas. Value adding Investment opportunities in GTA include multi-family, land development, infill development and conversion.