Skip to content

oneKn8/QUASAR

Repository files navigation

QUASAR: QUantum Algorithm Search via Augmented Reasoning

An autonomous agent that discovers novel quantum circuits using 15TB of classical physics simulations to achieve 100x faster exploration than traditional methods.


What QUASAR Does

QUASAR takes a physics problem and discovers quantum circuits that solve it:

  1. Input: Physics goal (e.g., "Find ground state of 6-qubit MHD lattice")
  2. Propose: Physics-augmented LLM generates candidate circuits informed by conservation laws and symmetries
  3. Filter: Surrogate model scores 10,000+ candidates in milliseconds (vs hours with VQE)
  4. Evaluate: Top 10% undergo full VQE optimization
  5. Validate: Best circuits run on IBM quantum hardware
  6. Output: Novel circuits that outperform human-designed ansatzes

Architecture

THE WELL (15TB) ──► PHYSICS ENCODER ──┬──► SURROGATE EVALUATOR (10ms)
                                      │
                                      ├──► PHYSICS-AUGMENTED LLM TRAINING
                                      │
                                      └──► NEW HAMILTONIAN TARGETS (MHD)
                                              │
                                              ▼
                    ┌──────────────────────────────────────────┐
                    │            QUANTUMMIND AGENT             │
                    │                                          │
                    │  Physics Goal ──► LLM Proposer ──► Verifier
                    │                       │                  │
                    │                       │    ┌─────────────┘
                    │                       ▼    ▼
                    │                   SURROGATE FILTER (fast)
                    │                       │
                    │                       ▼ (top 10% only)
                    │                   VQE EXECUTOR (slow)
                    │                       │
                    │                       ▼
                    │                   MEMORY + LEARNING
                    └──────────────────────────────────────────┘

Core Components

Surrogate Evaluator

Predicts circuit quality without running VQE. Trained on physics dynamics from The Well, then fine-tuned on quantum circuit data.

Metric Value
Inference time 10ms per circuit
Accuracy R^2 > 0.7
Speedup 100x exploration capacity

Physics-Augmented LLM

Fine-tuned Qwen2.5-Coder-7B that understands physics, not just code patterns. Training data includes:

  • 50% QuantumLLMInstruct (code patterns)
  • 30% Physics-augmented examples from The Well
  • 20% Physics reasoning chains

Barren Plateau Detector

Screens circuits before VQE to avoid untrainable parameter landscapes. Catches >90% of barren plateaus using gradient variance analysis.

Hardware Validation

Discovered circuits run on IBM Quantum hardware (ibm_sherbrooke, 127 qubits) with ZNE error mitigation.


The Well Integration

QUASAR uses The Well dataset (15TB of physics simulations from Polymathic AI) for three purposes:

Purpose How
Surrogate pretraining Learn physics dynamics before quantum circuit data
LLM augmentation Generate physics-to-circuit training examples
New targets Map MHD/turbulence simulations to quantum Hamiltonians

Supported Datasets

Dataset Domain Use
shear_flow Fluid dynamics Initial proof of concept
MHD_64 Magnetohydrodynamics Novel discovery target
turbulence_gravity_cooling Astrophysics Many-body correlations
rayleigh_benard Convection Thermal physics

Discovery Targets

Toy Problems (Validation)

Hamiltonian Qubits Purpose
XY Chain 4, 6, 8 Validate against known solutions
Heisenberg 4, 6 Test physics reasoning
TFIM 4, 6, 8 Multiple field strengths

Novel Targets (Contribution)

Hamiltonian Source Why Novel
MHD_LATTICE The Well MHD_64 First quantum circuits for MHD

MHD circuits are validated against The Well ground truth, targeting >80% fidelity.


Directory Structure

quantum-mind/
├── src/
│   ├── quantum/           # Hamiltonians, BP detector, verifier, executor
│   ├── agent/             # Proposer, memory, analyzer, discovery loop
│   ├── training/          # Dataset utilities, fine-tuning
│   ├── evaluation/        # Baselines, metrics, statistical tests
│   └── quasar/            # Surrogate, Well loader, physics encoder
├── data/
│   ├── the_well/          # The Well datasets (HDF5)
│   ├── processed/         # Training data
│   └── surrogate/         # Surrogate training data
├── experiments/
│   ├── xy_chain/          # XY chain discovery campaigns
│   ├── heisenberg/        # Heisenberg campaigns
│   ├── tfim/              # TFIM campaigns
│   └── mhd/               # MHD discovery (novel)
├── models/
│   └── checkpoints/       # Fine-tuned model weights
├── guidelines/            # Build specifications (no code)
└── paper/                 # Research paper materials

Evaluation

Baselines

Baseline Description
HEA (2-layer) Hardware-efficient ansatz
HEA (3-layer) Deeper HEA
EfficientSU2 Qiskit optimized ansatz
Random Control baseline

Metrics

Metric Description
Energy error Absolute difference from exact ground state
Relative error Percentage error
Circuit depth Total layers
Two-qubit gates CX/CZ count (hardware cost)
Trainability Gradient variance (BP indicator)

Statistical Testing

All comparisons use paired t-tests or Wilcoxon signed-rank with 10+ trials per circuit. Effect sizes reported as Cohen's d.


Hardware Validation

Circuits run on IBM Quantum with:

Parameter Value
Backend ibm_sherbrooke (127 qubits)
Resilience level 2 (ZNE)
Shots 4000 per circuit
Runs 5x for statistics

Each hardware run records job ID, backend calibration date, transpiled depth, and raw/mitigated energies.


Key Claims

Claim Evidence
100x speedup Surrogate enables 10,000+ circuit exploration vs ~100 with VQE only
Physics-augmented improvement 20%+ lower energy error than code-only LLM
Novel MHD circuits First quantum circuits for magnetohydrodynamics
Hardware validated IBM Quantum results confirm simulation accuracy

Requirements

  • Python 3.10+
  • IBM Quantum account (free tier works)
  • GPU for training (A100 recommended)

Dependencies

qiskit
qiskit-ibm-runtime
qiskit-aer
torch
transformers
the-well
h5py
unsloth
peft
scipy

References

Papers

Datasets

Tools


Build Guidelines

Detailed specifications in guidelines/:

Phase Guideline Content
1 01_ENVIRONMENT.md IBM token, The Well setup
2 02_WELL.md Data loader, physics encoder
3 03_FINETUNING.md Physics-augmented LLM training
4 04_SURROGATE.md Surrogate model, discovery integration
5 05_EVALUATION.md Baselines, metrics, statistics
6 06_HARDWARE.md IBM runner, error mitigation
7 07_DISCOVERY.md Discovery campaigns (XY, Heisenberg, TFIM, MHD)
8 08_PAPER.md Results documentation, paper draft

The Contribution

QUASAR fills a gap: no one has used large-scale classical physics simulation data to improve quantum algorithm discovery.

Current State QUASAR Innovation
LLMs trained on code only LLMs trained on code + physics dynamics
VQE bottleneck (~100 circuits) Surrogate enables ~10,000 circuits
Toy problems only Real physics targets (MHD)
No physics grounding Physics-informed from 15TB simulations

About

Quantum circuit discovery agent. Physics-augmented LLM proposes candidates, surrogate model filters 10k+ in milliseconds, top 10% go through full VQE optimization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors