Purpose
Extracts products from page content using Gemini and saves them to the database. This is the first step in the product pipeline.Architecture
Why Chunking?
Large websites can have 50K+ chars of content. Asking Gemini to process all of it in one call causes timeouts. By splitting into 15K char chunks:- Faster responses - Each chunk completes in ~10-30s
- Parallel processing - All chunks run simultaneously
- Better reliability - If one chunk fails, others still succeed