← All work
HASAN
09 / 10·AI AGENTS·2024·Beta

Website to AI Assistant Pipeline with Auto-Updating Vector Knowledge

Crawl any website once. Get a smart AI assistant that stays current automatically.

Discuss similar work →
Category
AI Agents
Year
2024
Role
Full Stack AI Engineer
Duration
5 Days
Client
Mike Olaski - Slovakia
Status
Beta
01 THE CORE IDEA

The Core Idea

Any website has valuable knowledge locked in its pages. Product details, pricing, service info, FAQs. This pipeline unlocks all of it and turns it into a queryable AI assistant without any manual copy-pasting or document preparation.

PythonFirecrawlSupabaseFlaskOpenAIN8NRAG
02 CRAWLING AND DATA COLLECTION

Crawling and Data Collection

Firecrawl API scrapes the entire target website and returns the content in JSON, Markdown and HTML formats. Every page, product listing and content block gets captured in a clean structured form ready for processing.

03 HIGH SPEED VECTOR INGESTION

High Speed Vector Ingestion

A custom Flask API handles the embedding pipeline against Supabase pgvector. The architecture was optimized to process 50,000 rows of vector data in under 2 minutes, making it practical for large sites and product catalogs without long setup waits.

04 AI ASSISTANT

AI Assistant

The vectors power a RAG-based AI agent that answers questions about the site instantly. It returns prices, URLs, product details and images from the actual site content. For e-commerce or service businesses it acts as a knowledgeable support agent that knows the catalog better than most staff.

05 AUTO-UPDATING KNOWLEDGE BASE

Auto-Updating Knowledge Base

When the website publishes new content or changes existing pages, the system detects the update and re-embeds only the changed content. The assistant stays accurate without anyone running the pipeline manually.

06 RESULTS
50k
Vectors processed in under 2 mins
3
Output formats per page crawled
100%
Knowledge base auto-synced on update
0
Manual steps to keep it current
07 GET IN TOUCH

Interested in working together?

I'm available for new projects. Let's talk about what you're building.

More projects
WhatsApp Business Platform with Meta Cloud API
Web Apps
WhatsApp Business Platform with Meta Cloud API
A full WhatsApp Web style inbox for your business number, built from scratch.
View →
CRM Automation Dashboard for Marketing Agency
AI Agents
CRM Automation Dashboard for Marketing Agency
Full lead lifecycle automation with zero manual work needed.
View →
Production Web Apps Built with Claude, Cursor, Codex and Lovable
Web Apps
Production Web Apps Built with Claude, Cursor, Codex and Lovable
From idea to deployed product in days, not weeks, with AI as a build collaborator.
View →
HubSpot CRM Automation and Full Ecosystem Integration
CRM
HubSpot CRM Automation and Full Ecosystem Integration
HubSpot working like an AI agent, not just a CRM.
View →
Meta Ads Lead Automation with AI Enrichment and WhatsApp Alerts
AI Agents
Meta Ads Lead Automation with AI Enrichment and WhatsApp Alerts
From ad form submission to enriched lead profile in seconds, fully automated.
View →
AI Person and Company Research Agent with Vector Memory
AI Agents
AI Person and Company Research Agent with Vector Memory
Drop a name. Get a full intelligence profile and a custom AI agent, automatically.
View →
AI-Powered Lead Scraper from Reddit, Quora and LinkedIn
AI Agents
AI-Powered Lead Scraper from Reddit, Quora and LinkedIn
Finds your next customer by what they post, not what they fill out.
View →
AI Voice Receptionist for Dental Clinic
AI Agents
AI Voice Receptionist for Dental Clinic
Answers every patient call, books appointments and sends reminders, around the clock.
View →
Full Stack Vibe Coding with AI Tools and Third Party Integrations
Web Apps
Full Stack Vibe Coding with AI Tools and Third Party Integrations
Ship production-ready apps fast using modern AI-assisted development tools.
View →
H A S A N
H A S A N

Full-stack developer & AI agent engineer. Available for select engagements. Currently building voice-first agents for SaaS teams.

Sitemap
HomeWorkTimelineAboutSkillsContact
Connect
LinkedInGitHubFacebookUpwork
Direct
hasan@latticecode.proWhatsAppBook a call
© 2026 HASAN KHAN · ALL RIGHTS RESERVED
BUILT WITH ● IN KHULNA