Automated Web Data Processing and Summarization
Project scope
Categories
Data analysis Information technology Software development Artificial intelligence Data scienceSkills
web scraping data extraction data curation data processing python (programming language) data cleansing real world data management wordpress artificial intelligenceThe project aims to develop an automated system for web data extraction, cleansing, and summarization using Python. Lux Development seeks to streamline the process of gathering and processing web data to enhance efficiency and accuracy. The project involves creating two distinct automated processes. The first process will focus on web scraping to extract a list of items from specified websites, followed by data cleansing and curation into a structured CSV format. The second process will utilize AI to generate concise summaries of each website's content, subsequently formatting these summaries into a CSV file suitable for import into a WordPress directory. This project provides learners with the opportunity to apply their knowledge of Python programming, data processing, and AI to solve real-world data management challenges.
The project deliverables include two Python Notebooks, each implementing one of the automated processes. The first deliverable is a Python Notebook that scrapes websites, cleanses, and curates data into a specified CSV format. The second deliverable is a Python Notebook that uses AI to summarize website content and formats the summaries into a CSV file for WordPress import. Additionally, comprehensive documentation detailing the setup, execution, and functionality of each process will be provided.
Sharing knowledge in specific technical skills, techniques, methodologies required for the project.
Direct involvement in project tasks, offering guidance, and demonstrating techniques.
Providing access to necessary tools, software, and resources required for project completion.
Scheduled check-ins to discuss progress, address challenges, and provide feedback.
About the company
Lux Development focuses on investing in residential, multi-family, land and commercial assets which have growth potentials. We acquire, improve to add value and exit on assets with the goal of maximizing investor returns and minimizing risks, with sustainability in mind. Lux’s focal investment locations include the Greater Toronto Area (GTA) and the surrounding areas. Value adding Investment opportunities in GTA include multi-family, land development, infill development and conversion.