Project details:
Project created while working for Udooku LLC.
Plan the requirements for the web app, which has the goal of scraping links, images, and various video formats from websites (html video, Vimeo, YouTube, DaCast, etc.). After scraping the assets, the web app will return a report about each asset’s status so that the final user can take actions based on the reports. The main features of this project are the cron jobs that run every hour per site, using Java multithreading resources in order to achieve a balance between crawling speed and data comparison.
The backend was developed using Spring Boot and Java and frontend using Next.js, React with TypeScript and Tailwind CSS.
Tech stack: Java, Spring Boot, PostGreSQL, Next.js, TypeScript, React 18, Tailwind CSS