May 3, 2026

AGENTIC.LIVE — The broadcast booth in a browser tab

gemini ai agents cricket accessibility sign language multilingual real-time build with ai

AGENTIC.LIVE — The broadcast booth in a browser tab

Point your phone or laptop camera at any cricket screen — TV, projector, live stadium — and six Gemini agents become a full broadcast booth in real time.

Six Gemini agents working together:

  • VISION watches video frames and generates Hinglish commentary in your chosen persona's voice (Bhogle / Shastri / Sidhu).
  • STAT OCRs the on-screen scoreboard so the system can't hallucinate scores.
  • DIRECTOR gates every line — APPROVE / REWRITE / HOLD — preventing stale or off-key output.
  • CROWD reads viewer emoji reactions in 5-second windows and tells the Director when the room is on fire.
  • TRANSLATOR fans out captions to English, Hindi, Tamil and Bengali — each viewer phone receives their preferred language with sub-second latency.
  • ISL LISTENER independently listens to broadcast audio via the microphone, transcribes via Gemini, generates Indian Sign Language gloss, and feeds it into a 3D signing avatar.

Problems solved:

  1. India's 1.8 crore Deaf cricket fans get ISL on EVERY match — not just the 10 marquee broadcasts a year that have human interpreters.
  2. Multilingual viewers in the same watch-party room get captions in their language without separate streams.
  3. Anyone with a phone becomes their own broadcast booth — gully cricket, school finals, college tournaments, the IPL on your TV.

Built solo in 4 hours at the Build With AI :: Agentic Premier League hackathon, GDG Cloud New Delhi, 03 May 2026.

Tech stack:

AI / Google: Gemini 2.5 Flash via @google/genai SDK powering all six agents · Audio + Vision multimodal input (mic chunks + JPEG frames inlineData) · Gemini API Paid Tier (project-billed) · AntiGravity IDE (Google's agentic IDE).

Frontend: Next.js 16 (App Router, React 19, TypeScript strict) · Tailwind CSS v4 + shadcn/ui · Inter Tight + Geist Mono · Browser SpeechSynthesis API for persona-mapped TTS.

Realtime: PartyKit on Cloudflare Workers (sub-second pub/sub) · partysocket React hook for bidirectional commentary fanout + emoji reactions back to the caster.

Accessibility: Vaani 3D ISL signing avatar integrated via /embed postMessage protocol — Audio → Gemini transcribe → ISL gloss → postMessage → avatar signs.

Hosting: Vercel (Next.js production) · Cloudflare Workers (PartyKit room server).

Open-source code lifted: google-gemini/live-api-web-console (Apache-2.0) — audio worklets + GenAI client patterns.

Links:

1

Give a star to encourage!Discussion
Start a new conversation!
Login to join the discussion