Anywhere

Go Anywhere — A voice-controlled virtual explorer that transforms Google Street View into an interactive, AI-guided tour experience.

Overview

Anywhere is a next-generation virtual tour guide powered by Google's Gemini AI. Unlike traditional Street View exploration, Anywhere provides:

Voice Interaction: Talk naturally with an AI tour guide using Gemini Live API
Intelligent Navigation: AI-controlled camera movements via function calling (not browser automation)
Real-Time Knowledge: Live Google Search grounding for accurate, up-to-date information
AI Selfie Souvenirs: Generate composite images placing you in any location worldwide

Features

🎙️ Voice-Controlled Navigation

Speak naturally to navigate:

"Take me to the Colosseum"
"Turn around and show me what's behind us"
"Walk forward down this street"
"What's that building on the left?"

🗺️ Global Exploration

Access any location with Street View coverage:

Instant teleportation to landmarks worldwide
Smooth GSAP-animated camera movements
Real-time location awareness with reverse geocoding

🧠 AI Tour Guide

Get intelligent commentary powered by Gemini:

Historical facts and architectural details
Local recommendations and cultural insights
Context-aware descriptions of visible landmarks

📸 AI Selfie Generation

Create souvenir photos with Nano Banana Pro:

Upload your photo and place yourself in any scene
Multiple styles: Polaroid, Vintage, Professional, Fun
High-quality composites with matched lighting

Technology Stack

Category	Technology
Framework	Next.js 16.0.7 with React 19.2
AI	Gemini Live API, Gemini 3 Pro Image
Maps	Google Maps JavaScript API
Animation	GSAP 3.12
State	Zustand 5
UI	Shadcn UI + Tailwind CSS 4
Audio	Web Audio API

Getting Started

Prerequisites

Latest Node.js at time of last commit (24.x)
pnpm (recommended) or npm
Google Cloud Platform account
Google AI Studio account

API Keys Setup

Google Maps API Key

Go to Google Cloud Console
Create a new project or select an existing one
Navigate to APIs & Services > Library
Enable the following APIs:
- Maps JavaScript API
- Street View Static API
- Geocoding API
- Places API
Go to APIs & Services > Credentials
Click Create Credentials > API Key
(Recommended) Restrict the key to your domain and the required APIs

Gemini API Key

Go to Google AI Studio
Click Get API key or navigate to API keys
Create a new API key
Ensure access to:
- Gemini Live API (real-time audio streaming)
- Gemini 3 Pro Image (selfie generation)

Installation

# Clone the repository
git clone https://github.com/yourusername/anywhere.git
cd anywhere

# Install dependencies
pnpm install

# Create environment file
cp .env.example .env.local

# Add your API keys to .env.local
# NEXT_PUBLIC_GOOGLE_MAPS_API_KEY=your_maps_key_here
# NEXT_PUBLIC_GEMINI_API_KEY=your_gemini_key_here

# Start development server
pnpm dev

Open http://localhost:3000 to start exploring.

Usage

Connect: Click the green phone button to connect to the AI tour guide
Speak: Click the microphone (or press Space) and speak naturally
Navigate: Ask to visit places, turn around, or walk forward
Learn: Ask about landmarks, history, or local recommendations
Selfie: Click the camera button to create AI-generated souvenirs

Voice Commands Examples

Command	Action
"Take me to the Eiffel Tower"	Teleports to Paris
"Turn around"	Rotates view 180°
"Walk forward"	Advances along the street
"Look up at the ceiling"	Tilts camera upward
"What's the history of this place?"	Triggers Google Search for facts
"Take a selfie"	Opens selfie generation dialog

Project Structure

src/
├── app/
│   ├── api/selfie/          # Server-side selfie API
│   ├── layout.tsx           # Root layout with fonts
│   ├── page.tsx             # Main application page
│   └── globals.css          # Global styles + Tailwind
├── components/
│   ├── street-view/         # Street View panorama
│   ├── anywhere-explorer.tsx # Main orchestrator
│   ├── voice-control-panel.tsx
│   ├── location-overlay.tsx
│   ├── selfie-dialog.tsx
│   └── tour-history-sheet.tsx
├── lib/
│   ├── gemini-live-client.ts # Gemini Live API client
│   ├── audio-handler.ts     # Microphone & playback
│   ├── navigation-tools.ts  # AI function declarations
│   ├── selfie-generator.ts  # Image generation
│   ├── system-prompt.ts     # AI persona definition
│   └── maps-loader.ts       # Google Maps loader
└── stores/
    └── street-view-store.ts # Zustand state management

Architecture

Key Design Decision: Function Calling over Browser Automation

Street View renders as a WebGL canvas with no DOM buttons. Browser automation tools fail because there's nothing to click. Instead, Anywhere uses Agentic Function Calling:

AI receives voice input: "Turn around"
AI issues structured command: pan_camera({ heading: 180, pitch: 0 })
Frontend executes against Google Maps API with GSAP animation
Updated viewport context sent back to AI

Audio Pipeline

Microphone → Web Audio API → PCM 16-bit 16kHz → Gemini Live API
                                                      ↓
                                            PCM 24kHz Response
                                                      ↓
                      Speakers ← Web Audio API ← PCM to Float32

Environment Variables

# Required
NEXT_PUBLIC_GOOGLE_MAPS_API_KEY=your_google_maps_api_key
NEXT_PUBLIC_GEMINI_API_KEY=your_gemini_api_key

# Optional (for server-side selfie generation)
GEMINI_API_KEY=your_gemini_api_key

Deployment

Just use Vercel.

Known Limitations

Limitation	Mitigation
Street View coverage gaps	AI detects and suggests alternatives
Live API latency	Visual feedback during processing
Image generation quality	Multiple style options, regeneration
Browser audio compatibility	Tested on Chrome, Firefox, Safari

Acknowledgments

Google Gemini AI for powering the tour guide intelligence
Google Maps Platform for Street View
Shadcn UI for the beautiful component library
GSAP for smooth animations

Built for the Gemini 3 Hackathon — December 2025

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
public		public
src		src
.env.local.example		.env.local.example
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
ANYWHERE_TECHNICAL_SPECIFICATION.md		ANYWHERE_TECHNICAL_SPECIFICATION.md
LICENSE		LICENSE
README.md		README.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Anywhere

Overview

Features

🎙️ Voice-Controlled Navigation

🗺️ Global Exploration

🧠 AI Tour Guide

📸 AI Selfie Generation

Technology Stack

Getting Started

Prerequisites

API Keys Setup

Google Maps API Key

Gemini API Key

Installation

Usage

Voice Commands Examples

Project Structure

Architecture

Key Design Decision: Function Calling over Browser Automation

Audio Pipeline

Environment Variables

Deployment

Known Limitations

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

Ari-S-123/anywhere

Folders and files

Latest commit

History

Repository files navigation

Anywhere

Overview

Features

🎙️ Voice-Controlled Navigation

🗺️ Global Exploration

🧠 AI Tour Guide

📸 AI Selfie Generation

Technology Stack

Getting Started

Prerequisites

API Keys Setup

Google Maps API Key

Gemini API Key

Installation

Usage

Voice Commands Examples

Project Structure

Architecture

Key Design Decision: Function Calling over Browser Automation

Audio Pipeline

Environment Variables

Deployment

Known Limitations

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages