The model
Gildea uses BAAI/bge-base-en-v1.5, a widely-used English-optimized embedding model from the Beijing Academy of AI:| Property | Value |
|---|---|
| Model | BAAI/bge-base-en-v1.5 |
| Dimensions | 768 |
| Output | Normalized (unit-length) vectors — use dot product for cosine similarity |
| Language | English |
/v1/signals/{id}?include=embeddings and /v1/embed) live in the same space and can be compared directly.
Two endpoints, one vector space
Per-unit embeddings
Add
?include=embeddings to a signal detail request to get embeddings for every text unit in the decomposition, alongside the text itself.Embed your text
POST arbitrary text (up to 2000 characters) to
/v1/embed to get a vector in the same space. Use for embedding user content against Gildea’s corpus.Primary use case: local similarity
The pairing of these two endpoints enables client applications to compute semantic similarity between user content and Gildea’s verified extractions, entirely client-side:- Ingest Gildea signals with embeddings into local storage. Each text unit comes with its vector.
- Embed user content (notes, queries, drafts) via
/v1/embedas it’s created. - Compute cosine similarity locally between user vectors and stored Gildea vectors to find related claims, sentences, or theses.
Model stability
Theembedding_model field is returned with every embedding so clients can detect changes. If Gildea upgrades the embedding model, the field will reflect the new identifier, and clients should re-embed any user content they’ve stored locally. Existing Gildea text unit embeddings will also be re-computed on the next pipeline run.
Notes
- Rate limits apply. Both endpoints share your tier’s monthly request budget. See Rate Limits.
- Vectors are normalized. Each vector has unit length, so cosine similarity equals the dot product.
- Input length cap.
POST /v1/embedaccepts up to 2000 characters per request. Truncate longer content client-side. - No aggregate embeddings. Gildea stores one embedding per text unit, not per signal. To get a signal-level representation, aggregate unit embeddings client-side (mean or max-pool) if needed.