🧠 Brainy API Reference
Complete API documentation for Brainy Zero Configuration • Triple Intelligence • Database as a Value • Atomic Transactions • Time Travel
Updated: 2026-06-11 All APIs verified against actual code
Quick Start
import { Brainy, NounType, VerbType } from '@soulcraft/brainy'
const brain = new Brainy() // Zero config!
await brain.init() // VFS auto-initialized!
// Add data (text auto-embeds!)
const id = await brain.add({
data: 'The future of AI is here',
type: NounType.Concept,
metadata: { category: 'technology' }
})
// Search with Triple Intelligence
const results = await brain.find({
query: 'artificial intelligence',
where: { year: { greaterThan: 2020 } },
connected: { from: id, depth: 2 }
})
// Pin the current state as an immutable value
const db = brain.now()
// Commit an atomic multi-write batch (all-or-nothing)
await brain.transact([
{ op: 'update', id, metadata: { category: 'AI' } }
], { meta: { author: 'docs-example' } })
await db.get(id) // still sees the pre-transaction state — snapshot isolation
await db.release()
// Time travel: query any past state
const yesterday = await brain.asOf(new Date(Date.now() - 86_400_000))Core Concepts
🧬 Entities (Nouns)
Semantic vectors with metadata and relationships - the fundamental data unit in Brainy.
🔗 Relationships (Verbs)
Typed connections between entities with optional data and metadata - building knowledge graphs.
📊 Data vs Metadata
data: Content embedded into vectors. Searchable via semantic similarity (vector index) and hybrid text+semantic search. NOT queryable viawherefilters.metadata: Structured fields indexed by MetadataIndex. Queryable viawherefilters infind().
See Data Model for the full explanation.
🧠 Triple Intelligence
Vector search + Graph traversal + Metadata filtering in one unified query.
🧊 Database Values (Db)
The whole store pinned as an immutable value — snapshot isolation, atomic transact() batches, time travel with asOf(), instant hard-link snapshots with persist(). See the consistency model.
Table of Contents
- Core CRUD Operations
- Search & Query
- Aggregation Engine
- Relationships
- Batch Operations
- Database Values & Time Travel (Db API)
- Virtual Filesystem (VFS)
- Neural API
- Import & Export
- Configuration
- Storage Adapters
- Utility Methods
- Embedding & Analysis APIs
- Type System Reference
Core CRUD Operations
`add(params)` → `Promise`
Add a single entity to the database.
const id = await brain.add({
data: 'JavaScript is a programming language', // Text or pre-computed vector
type: NounType.Concept, // Required: Entity type
subtype: 'language', // Optional: sub-classification
metadata: { // Optional: queryable fields
category: 'programming',
year: 1995
}
})Parameters:
data:string | number[]- Content to embed (text auto-embeds) or pre-computed vectortype:NounType- Entity type (required)subtype?:string- Per-product sub-classification within the NounType (top-level standard field, indexed on the fast path). See Subtypes & Facets.metadata?:object- Structured queryable fields (indexed by MetadataIndex, used inwherefilters)id?:string- Custom ID (auto-generated UUID if not provided)vector?:number[]- Pre-computed vector (skips auto-embedding)confidence?:number- Type classification confidence (0-1)weight?:number- Entity importance/salience (0-1)ifAbsent?:boolean- By-ID idempotent insert. WhentrueAND a customidis supplied AND an entity with thatidalready exists, returns the existingidwithout writing (no throw, no overwrite). Ignored withoutid. See guides/optimistic-concurrency.
datais embedded into vectors for semantic search.metadatais indexed forwherefilters. See Data Model.
Strict-mode tip: if a vocabulary is registered for your
type(viabrain.requireSubtype()or by an SDK that wraps Brainy), you must pass a matchingsubtype. Runawait brain.audit()to inventory pre-existing gaps before enabling strict mode; see the migration recipe.
Returns: Promise<string> - Entity ID
`get(id)` → `Promise`
Retrieve a single entity by ID.
const entity = await brain.get(id)
console.log(entity?.data) // Original data
console.log(entity?.metadata) // Metadata
console.log(entity?.vector) // Embedding vectorParameters:
id:string- Entity ID
Returns: Promise<Entity | null> - Entity or null if not found
`update(params)` → `Promise`
Update an existing entity.
await brain.update({
id: entityId,
data: 'Updated content', // Optional: new data
subtype: 'archived', // Optional: change sub-classification
metadata: { updated: true } // Optional: new metadata (merges)
})Parameters:
id:string- Entity IDdata?:string | number[]- New data/vectortype?:NounType- Change entity typesubtype?:string- Change subtype (omit to preserve existing)metadata?:object- Metadata to merge (or replace withmerge: false)confidence?:number- Update classification confidenceweight?:number- Update entity importanceifRev?:number- Optimistic-concurrency check. When provided, the update throwsRevisionConflictErrorif the persisted entity's_revno longer equalsifRev. See guides/optimistic-concurrency.
Returns: Promise<void>
Tip — read-then-CAS. Every entity returned by
get()/find()/search()carriesentity._rev(a monotonic counter Brainy auto-bumps on every successfulupdate()). Pass it back asifRevto make multi-writer coordination safe without an external lock service. Full guide: guides/optimistic-concurrency.
`remove(id)` → `Promise`
Remove a single entity (and every relationship where it is source or target).
await brain.remove(id)Parameters:
id:string- Entity ID
Returns: Promise<void>
Search & Query
`find(query)` → `Promise`
Triple Intelligence - Vector + Graph + Metadata in ONE query.
// Simple text search
const results = await brain.find('machine learning')
// Advanced Triple Intelligence query
const results = await brain.find({
query: 'artificial intelligence', // Vector similarity
where: { // Metadata filtering
year: { greaterThan: 2020 },
category: { oneOf: ['AI', 'ML'] }
},
connected: { // Graph traversal
to: conceptId,
depth: 2,
type: VerbType.RelatedTo
},
limit: 10
})Parameters:
query:string | FindParams- Simple: Just text for vector search
- Advanced: Object with vector + graph + metadata filters
FindParams:
query?:string- Text for semantic + hybrid search (searchesdatavia the vector index + text index)type?:NounType | NounType[]- Filter by entity type(s). Alias forwhere.noun.subtype?:string | string[]- Filter by sub-classification (top-level standard field, fast path). Single string for equality, array for set membership.where?:object- Metadata filters. See Query Operators for all operators.connected?:object- Graph traversal optionsto?:string- Target entity IDfrom?:string- Source entity IDvia?:VerbType | VerbType[]- Relationship type(s) to traversetype?:VerbType | VerbType[]- Alias forviadepth?:number- Traversal depth (default: 1)direction?:'in' | 'out' | 'both'- Traversal direction (default: 'both')limit?:number- Max results (default: 10)offset?:number- Skip resultsorderBy?:string- Field to sort by (e.g., 'createdAt', 'metadata.priority')order?:'asc' | 'desc'- Sort direction (default: 'asc')searchMode?:'auto' | 'text' | 'semantic' | 'hybrid'- Search strategy:'auto'(default): Zero-config hybrid combining text + semantic search'text': Pure keyword/text matching'semantic'/'vector': Pure vector similarity'hybrid': Explicit hybrid modehybridAlpha?:number- Balance between text (0.0) and semantic (1.0) search. Auto-detected by query length if not specified.excludeVFS?:boolean- Exclude VFS entities from results (default: false)
limittip: Brainy capslimitagainst an auto-configured maximum (based on container/free memory, ~25 KB per result). Above the cap you get a one-time warning per call site; above 2× the cap it throws. To raise the cap, passnew Brainy({ maxQueryLimit: N })or{ reservedQueryMemory: bytes }. For queries that need ALL matches, paginate with{ limit, offset }— that's the only pattern guaranteed to keep working across Brainy versions. See Query Limits & Pagination.
Returns: Promise<Result[]> - Matching entities with scores
Hybrid Search
Brainy automatically combines text (keyword) and semantic (vector) search for optimal results. No configuration needed.
// Zero-config hybrid search (just works)
const results = await brain.find({
query: 'David Smith' // Finds both exact text matches AND semantically similar
})
// Force text-only search (exact keyword matching)
const textResults = await brain.find({
query: 'exact keyword',
searchMode: 'text'
})
// Force semantic-only search (vector similarity)
const semanticResults = await brain.find({
query: 'artificial intelligence concepts',
searchMode: 'semantic'
})
// Custom hybrid weighting (0 = text only, 1 = semantic only)
const customResults = await brain.find({
query: 'David Smith',
hybridAlpha: 0.3 // Favor text matching
})How it works:
- Short queries (1-2 words) automatically favor text matching
- Long queries (5+ words) automatically favor semantic search
- Results are combined using Reciprocal Rank Fusion (RRF)
Match Visibility
Search results include detailed match information:
const results = await brain.find({ query: 'david the warrior' })
// Each result now includes:
results[0].textMatches // ["david", "warrior"] - exact query words found
results[0].textScore // 0.25 - text match quality (0-1)
results[0].semanticScore // 0.87 - semantic similarity (0-1)
results[0].matchSource // 'both' | 'text' | 'semantic'Use cases:
- Highlight exact matches in UI (textMatches)
- Explain why a result ranked high (matchSource)
- Debug search behavior (separate scores)
`highlight(params)` → `Promise` ✨
Zero-config highlighting for both exact matches AND semantic concepts. Handles plain text, rich-text JSON (TipTap, Slate, Lexical, Draft.js, Quill), HTML, and Markdown automatically.
// Plain text (works as before)
const highlights = await brain.highlight({
query: "david the warrior",
text: "David Smith is a brave fighter who battles dragons"
})
// [
// { text: "David", score: 1.0, position: [0, 5], matchType: 'text' },
// { text: "fighter", score: 0.78, position: [25, 32], matchType: 'semantic' },
// { text: "battles", score: 0.72, position: [37, 44], matchType: 'semantic' }
// ]
// Rich-text JSON (auto-detected)
const highlights = await brain.highlight({
query: "david the warrior",
text: JSON.stringify(tiptapDocument) // TipTap, Slate, Lexical, Draft.js, Quill
})
// Extracts text from nodes, annotates with contentCategory:
// [
// { text: "David", score: 1.0, matchType: 'text', contentCategory: 'title' },
// { text: "fighter", score: 0.78, matchType: 'semantic', contentCategory: 'content' }
// ]
// HTML input (auto-detected)
const highlights = await brain.highlight({
query: "warrior",
text: "<h1>David the Warrior</h1><p>A brave fighter.</p>"
})
// Custom extractor for proprietary formats
const highlights = await brain.highlight({
query: "function",
text: sourceCode,
contentExtractor: (text) => treeSitterParse(text) // Your custom parser
})Parameters:
query:string- The search querytext:string- Text to highlight (plain text, JSON, HTML, or Markdown)granularity?:'word' | 'phrase' | 'sentence'- Highlight unit (default: 'word')threshold?:number- Min similarity for semantic matches (default: 0.5)contentType?:ContentType- Optional hint:'plaintext' | 'richtext-json' | 'html' | 'markdown'. Skips auto-detection when provided.contentExtractor?:(text: string) => ExtractedSegment[]- Custom parser. Bypasses built-in detection entirely.
Returns: Promise<Highlight[]>
text- The matched textscore- Match score (1.0 for text matches, varies for semantic)position- [start, end] indices in extracted textmatchType-'text'(exact) or'semantic'(concept)contentCategory?-'title' | 'annotation' | 'content' | 'value' | 'code' | 'structural'— Role of the source text. Built-in extractors produce'title','content','code'. All 6 categories are available for custom parsers.
Supported Rich-Text Formats:
| Format | Detection | Text nodes |
|---|---|---|
| TipTap / ProseMirror | { type: 'doc', content: [...] } |
{ type: 'text', text } |
| Slate.js | [{ type, children }] |
{ text } |
| Lexical | { root: { children } } |
{ type: 'text', text } |
| Draft.js | { blocks: [{ text }] } |
{ text } in block |
| Quill Delta | { ops: [{ insert }] } |
{ insert } |
| HTML | Tags like <h1>, <p>, <code> |
Visible text content |
| Markdown | # headings, ``` code blocks |
Stripped markup |
Timeout Protection:
Semantic matching has a 10-second timeout. If embedding takes too long (e.g., WASM stall), highlight() returns text-only matches instead of hanging.
UI Pattern:
// Style differently based on match type and content category
highlights.forEach(h => {
const style = h.matchType === 'text' ? 'font-weight: bold' : 'background: yellow'
if (h.contentCategory === 'title') { /* render as heading highlight */ }
if (h.contentCategory === 'code') { /* render with code styling */ }
if (h.contentCategory === 'annotation') { /* render as comment/caption */ }
// Apply style from h.position[0] to h.position[1]
})Query Operators
Brainy uses clean, readable operators (BFO — Brainy Field Operators):
| Operator | Description | Example |
|---|---|---|
equals / eq |
Exact match | {age: {equals: 25}} |
notEquals / ne |
Not equal | {status: {notEquals: 'deleted'}} |
greaterThan / gt |
Greater than | {age: {greaterThan: 18}} |
greaterEqual / gte |
Greater or equal | {score: {greaterEqual: 90}} |
lessThan / lt |
Less than | {price: {lessThan: 100}} |
lessEqual / lte |
Less or equal | {rating: {lessEqual: 3}} |
between |
Inclusive range | {year: {between: [2020, 2025]}} |
oneOf / in |
In array | {color: {oneOf: ['red', 'blue']}} |
noneOf |
Not in array | {status: {noneOf: ['deleted']}} |
contains |
Array contains value | {tags: {contains: 'ai'}} |
exists / missing |
Field existence | {email: {exists: true}} |
startsWith |
String prefix | {name: {startsWith: 'John'}} |
endsWith |
String suffix | {email: {endsWith: '@gmail.com'}} |
matches |
Pattern match | {text: {matches: /^[A-Z]/}} |
allOf |
AND combinator | {allOf: [{active: true}, {role: 'admin'}]} |
anyOf |
OR combinator | {anyOf: [{role: 'admin'}, {role: 'owner'}]} |
Complete Operator Reference → — all operators, aliases, indexed vs in-memory support matrix, and practical examples.
Aggregation Engine
Brainy's aggregation engine maintains incremental running totals at write time, delivering O(1) aggregate reads regardless of dataset size. Define aggregates once, and every add(), update(), and delete() automatically updates the running metrics.
`defineAggregate(definition)` → `void`
Register a named aggregate for incremental computation.
brain.defineAggregate({
name: 'monthly_spending',
source: {
type: NounType.Event,
where: { domain: 'financial' } // matches custom metadata fields
},
groupBy: [
'category',
{ field: 'date', window: 'month' } // Time-windowed dimension
],
metrics: {
total: { op: 'sum', field: 'amount' },
count: { op: 'count' },
average: { op: 'avg', field: 'amount' },
highest: { op: 'max', field: 'amount' },
lowest: { op: 'min', field: 'amount' },
spread: { op: 'stddev', field: 'amount' } // Welford's online algorithm
},
materialize: true // Optional: write results as NounType.Measurement entities
})Parameters:
| Field | Type | Description |
|---|---|---|
name |
string |
Unique identifier for this aggregate |
source.type |
NounType | NounType[] |
Entity types that feed into this aggregate |
source.where |
Record<string, unknown> |
Filter on custom metadata fields (matched against the entity's metadata bag) |
source.service |
string |
Multi-tenancy filter |
groupBy |
GroupByDimension[] |
Dimensions to group by — plain field names, { field, window } for time bucketing, or { field, unnest: true } for array fields (one contribution per element) |
metrics |
Record<string, AggregateMetricDef> |
Named metrics with op (sum, count, avg, min, max, stddev, variance, percentile, distinctCount) and optional field. percentile additionally requires p in [0, 1]. |
materialize |
boolean | object |
Write results as NounType.Measurement entities (auto-visible in OData/Sheets/SSE) |
Time window granularities: 'hour', 'day', 'week', 'month', 'quarter', 'year', or { seconds: number } for custom intervals.
`removeAggregate(name)` → `void`
Remove a named aggregate and clean up its state.
brain.removeAggregate('monthly_spending')Querying Aggregates via `find()`
Aggregate results are queried through the standard find() method using the aggregate parameter:
// Simple: query by name
const results = await brain.find({ aggregate: 'monthly_spending' })
// With filtering on group keys
const foodOnly = await brain.find({
aggregate: 'monthly_spending',
where: { category: 'food' }
})
// With sorting and pagination
const topCategories = await brain.find({
aggregate: {
name: 'monthly_spending',
orderBy: 'total',
order: 'desc',
limit: 10
}
})
// Combine find-level params (where, orderBy, limit, offset merge automatically)
const recentFood = await brain.find({
aggregate: 'monthly_spending',
where: { category: 'food' },
orderBy: 'total',
order: 'desc',
limit: 12
})Result format: Returns Result<T>[] with type: NounType.Measurement. Each result contains:
{
id: string, // Aggregate group ID (or materialized entity ID)
score: 1.0, // Always 1.0 for aggregates
type: NounType.Measurement,
metadata: {
__aggregate: 'monthly_spending', // Source aggregate name
category: 'food', // Group key values
date: '2024-01', // Time window bucket
total: 342.50, // Computed metrics
count: 28,
average: 12.23,
highest: 45.00,
lowest: 2.50
},
entity: Entity // Full entity structure
}`queryAggregate(name, params?)` → `Promise`
The first-class analytics path — returns plain group rows ({ groupKey, metrics, count }) instead of find()-style Result wrappers. Supports where (group-key filter), having (SQL-HAVING metric filter), orderBy, order, limit, offset:
const rows = await brain.queryAggregate('monthly_spending', {
having: { total: { greaterThan: 100 } }, // filter by computed metrics
orderBy: 'total',
order: 'desc',
limit: 10
})
// [{ groupKey: { category: 'food', date: '2024-01' }, metrics: { total: 342.5, count: 28 }, count: 28 }, ...]How It Works
Aggregation hooks run outside transactions on every write operation:
add(): If the new entity matches any aggregate'ssourcefilter, its values are added to the matching group's running totals.update(): The old entity's contribution is reversed and the new entity's contribution is applied (handles group key changes, source filter changes).delete(): The deleted entity's contribution is reversed from its group.
Performance: O(A × G × M) per write where A = matching aggregates, G = groupBy dimensions, M = metrics. For typical configurations (2-5 aggregates, 1-3 dimensions, 3-5 metrics), this is effectively O(1) — measured at 10,000 entities in 13ms in unit tests.
Infinite loop prevention: Materialized NounType.Measurement entities (with service: 'brainy:aggregation' or metadata.__aggregate) are automatically excluded from all aggregate source matching.
Persistence: Definitions and running state are persisted to storage on flush()/close() and reloaded on init(). Definition changes are detected via FNV-1a hashing — only changed aggregates reset their state.
Native acceleration: Register an 'aggregation' provider via the plugin system to replace the TypeScript engine with a custom native implementation for higher throughput at scale.
Financial Data Modeling
Brainy supports financial analytics through subtypes and metadata conventions on existing NounTypes — no custom types needed:
// Transaction = NounType.Event + 'transaction' subtype + financial metadata
await brain.add({
data: 'Coffee at Blue Bottle',
type: NounType.Event,
subtype: 'transaction', // top-level standard field (reserved — never in metadata)
metadata: {
domain: 'financial',
amount: 5.50,
currency: 'USD',
category: 'food',
date: Date.now(),
merchant: 'Blue Bottle Coffee'
}
})
// Account = NounType.Collection + 'account' subtype + financial metadata
await brain.add({
data: 'Checking Account',
type: NounType.Collection,
subtype: 'account',
metadata: {
domain: 'financial',
accountType: 'checking',
currency: 'USD',
institution: 'Chase'
}
})
// Invoice = NounType.Document + 'invoice' subtype + financial metadata
await brain.add({
data: 'Invoice #1234 from Acme Corp',
type: NounType.Document,
subtype: 'invoice',
metadata: {
domain: 'financial',
amount: 15000,
currency: 'USD',
status: 'pending',
dueDate: Date.UTC(2024, 2, 15),
vendor: 'Acme Corp'
}
})Relationships
`relate(params)` → `Promise`
Create a typed relationship between entities.
const relId = await brain.relate({
from: sourceId,
to: targetId,
type: VerbType.ReportsTo,
subtype: 'direct', // Optional: sub-classification
data: 'Collaborated on the research paper', // Optional: content for this edge
metadata: { // Optional: structured edge fields
strength: 0.9,
role: 'primary author'
}
})Parameters:
from:string- Source entity ID (must exist)to:string- Target entity ID (must exist)type:VerbType- Relationship typesubtype?:string- Per-product sub-classification within the VerbType (top-level standard field, fast-path indexed). See Subtypes & Facets.data?:any- Content for the relationship (overrides auto-computed vector)metadata?:object- Structured edge fieldsweight?:number- Connection strength (0-1, default: 1.0)bidirectional?:boolean- Create reverse edge too (default: false)confidence?:number- Relationship certainty (0-1)
Strict-mode tip: same as
add()— if a vocabulary is registered for yourtype, pass a matchingsubtype. Runawait brain.audit()first to surface pre-existing gaps.
Returns: Promise<string> - Relationship ID
`updateRelation(params)` → `Promise`
Update an existing relationship. Mirror of update() for verbs — closed a long-standing gap (verbs had no update path before 7.30).
// Change the subtype on an existing relationship
await brain.updateRelation({ id: relId, subtype: 'dotted-line' })
// Update weight + confidence
await brain.updateRelation({ id: relId, weight: 0.7, confidence: 0.9 })
// Change verb type (re-indexes in graph adjacency, id preserved)
await brain.updateRelation({ id: relId, type: VerbType.WorksWith })Parameters:
id:string- Relationship ID (required)type?:VerbType- Change verb type (re-indexes in graph adjacency)subtype?:string- Change sub-classification (omit to preserve existing)weight?:number- New weight (0-1)confidence?:number- New confidence (0-1)data?:any- New contentmetadata?:object- Metadata to merge (or replace withmerge: false)merge?:boolean- Merge or replace metadata (default: true)
Returns: Promise<void>
`related(params)` → `Promise`
Get relationships for an entity. Same name and surface as db.related() on a
pinned Db view.
// Get all relationships FROM an entity
const outgoing = await brain.related({ from: entityId })
// Get all relationships TO an entity
const incoming = await brain.related({ to: entityId })
// Filter by type
const contains = await brain.related({
from: entityId,
type: VerbType.Contains
})
// Filter by subtype (fast path, column-store hit)
const direct = await brain.related({
from: entityId,
type: VerbType.ReportsTo,
subtype: 'direct'
})
// Set membership on subtype
const all = await brain.related({
from: entityId,
type: VerbType.ReportsTo,
subtype: ['direct', 'dotted-line']
})Parameters:
from?:string- Source entity IDto?:string- Target entity IDtype?:VerbType | VerbType[]- Filter by relationship typesubtype?:string | string[]- Filter by VerbType subtype (top-level standard field, fast path)service?:string- Multi-tenancy filterlimit?:number- Pagination limit (default: 100)offset?:number- Pagination offset
Returns: Promise<Relation[]> - Matching relationships (each with subtype at top level when set)
Batch Operations
`addMany(params)` → `Promise>`
Add multiple entities in one operation.
const result = await brain.addMany({
items: [
{ data: 'Entity 1', type: NounType.Document },
{ data: 'Entity 2', type: NounType.Concept }
]
})
console.log(result.successful) // Array of IDs
console.log(result.failed) // Array of errorsReturns: Promise<BatchResult<string>> - Success/failure results
`removeMany(params)` → `Promise>`
Remove multiple entities.
const result = await brain.removeMany({
ids: [id1, id2, id3]
})`updateMany(params)` → `Promise>`
Update multiple entities.
const result = await brain.updateMany({
items: [
{ id: id1, metadata: { updated: true } },
{ id: id2, data: 'New content' }
]
})`relateMany(params)` → `Promise`
Create multiple relationships.
const ids = await brain.relateMany({
items: [
{ from: id1, to: id2, type: VerbType.RelatedTo },
{ from: id1, to: id3, type: VerbType.Contains }
]
})Database Values & Time Travel (Db API)
Brainy 8.0's generational MVCC exposes the whole store as an immutable
value: the Db. Pin the current state in O(1), commit atomic
multi-write batches, query any past generation with the full query surface,
cut instant snapshots, and ask what-if questions in memory. The exact
guarantees live in the consistency model;
recipes live in Snapshots & Time Travel.
`generation()` → `number`
The store's current generation — a monotonic watermark advanced once per
committed transact() batch and once per single-operation write. Never
reissued, including across restarts and restore().
const g = brain.generation()`now()` → `Db`
Pin the current generation and return an immutable view — O(1), no I/O. The view keeps reading exactly this state no matter what commits afterwards.
const db = brain.now()
await brain.update({ id, metadata: { v: 2 } })
await db.get(id) // still sees v: 1 — pinned
await brain.get(id) // sees v: 2 — live
await db.release() // unpin (enables history compaction)Returns: Db — release it when done; pins gate compactHistory().
`transact(ops, options?)` → `Promise`
Execute a declarative operation batch atomically: either every operation applies and the store advances exactly one generation, or none apply and the store is byte-identical to its pre-transaction state. The commit point is an atomic manifest rename; a crash anywhere before it rolls back to the exact pre-transaction bytes on the next open.
const db = await brain.transact([
{ op: 'add', id: orderId, type: NounType.Document, subtype: 'order', data: 'Order #1042' },
{ op: 'update', id: customerId, metadata: { lastOrderAt: Date.now() }, ifRev: customer._rev },
{ op: 'relate', from: customerId, to: orderId, type: VerbType.Creates, subtype: 'purchase' },
{ op: 'remove', id: staleDraftId },
{ op: 'unrelate', id: oldRelationId }
], {
meta: { author: 'order-service', requestId: 'req-9f2' }, // reified, durable
ifAtGeneration: expectedGeneration // whole-store CAS
})
db.receipt.ids // resolved id per operation, in input order
db.receipt.generation // the committed generationOperations (op discriminates; parameters mirror the single-operation methods):
{ op: 'add', ... }— same parameters asadd(); optional explicitid{ op: 'update', ... }— same parameters asupdate(), including per-entityifRevCAS{ op: 'remove', id }— deletes the entity plus its relationships (same cascade asdelete()){ op: 'relate', ... }— same parameters asrelate(), includingbidirectional; duplicates dedupe to the existing relationship id{ op: 'unrelate', id }— deletes a relationship by id
Operations may reference ids created earlier in the same batch.
Options:
meta?:Record<string, unknown>— transaction metadata, recorded durably in the transaction log (audit fields: author, reason, request id)ifAtGeneration?:number— whole-store compare-and-swap; commits only if the store is still at this generation
Returns: Promise<Db> — pinned at the freshly committed generation, carrying a receipt.
Throws:
GenerationConflictError—ifAtGenerationdid not match (nothing staged, generation unchanged)RevisionConflictError— anifRevdid not match (whole batch rejected)
`asOf(target)` → `Promise`
Open an immutable view of past state:
const atGen = await brain.asOf(1041) // generation number
const lastWeek = await brain.asOf(new Date(Date.now() - 7 * 86_400_000)) // wall-clock
const fromSnapshot = await brain.asOf('/backups/2026-06-01') // snapshot directorynumber— pins that generation; reads resolve through the immutable record layer.Date— resolved via the transaction log to the newest generation committed at or before it.string— a snapshot directory fromdb.persist(), opened as a self-contained read-only store (equivalent toBrainy.load()).
Historical views serve the full query surface. Metadata-level reads are
free; the first index-accelerated query (semantic search, traversal,
cursors, aggregation) builds an in-memory index materialization — O(n at
that generation), once per Db, freed on release().
Throws: GenerationCompactedError when the generation's records were reclaimed by compactHistory().
History granularity: only transact() batches produce historical
records; single-operation writes advance the clock but stay visible through
earlier pins. See the consistency model.
`transactionLog(options?)` → `Promise`
Read the reified transaction log — one entry per committed transact()
batch, newest first: { generation, timestamp, meta? }.
const [latest] = await brain.transactionLog({ limit: 1 })
latest.meta // { author: 'order-service', requestId: 'req-9f2' }`compactHistory(options?)` → `Promise`
Reclaim historical record-sets that no retention rule and no live Db pin
protects. Pinned reads stay correct across compaction, always.
await brain.compactHistory({
retainGenerations: 100, // keep the 100 most recent commits
retainMs: 7 * 24 * 60 * 60 * 1000 // and everything from the last 7 days
})Returns: { removedGenerations, horizon } — asOf() below the horizon throws GenerationCompactedError.
`restore(path, { confirm: true })` → `Promise`
Replace the store's entire state from a snapshot directory. Destructive
— requires { confirm: true }. All indexes are rebuilt; the generation
counter is floored so observed generation numbers are never reissued; live
pins do not survive.
await brain.restore('/backups/2026-06-01', { confirm: true })`Brainy.load(path)` → `Promise` (static)
Open a persisted snapshot as a self-contained read-only store with the
full query surface, including vector search. Releasing the returned Db
closes the underlying instance.
const db = await Brainy.load('/backups/2026-06-01')
const hits = await db.search('quarterly invoices')
await db.release()The `Db` value
Every Db is pinned at one generation and serves the full query surface at
exactly that state.
Properties:
| Property | Type | Meaning |
|---|---|---|
generation |
number |
The pinned generation |
timestamp |
number |
Pin time (now()), commit time (transact()), or resolved commit time (asOf()) |
receipt |
TransactReceipt? |
Present only on transact() results |
speculative |
boolean |
Whether this view carries a with() overlay |
released |
boolean |
Whether release() has been called |
Methods:
await db.get(id) // entity as of this generation
await db.find({ where: { status: 'open' } }) // full find() surface
await db.search('unpaid invoices') // semantic search as of this generation
await db.related(entityId) // relationships as of this generation
await db.since(olderDb) // ids changed between two views
const whatIf = await db.with(ops) // speculative in-memory overlay
await db.persist('/backups/today') // self-contained hard-link snapshot
await db.release() // unpin + free cached materializationwith(ops)— appliestransact()-style operations in memory on top of the view; nothing touches disk, the generation counter, or index providers. Overlay entities carry no embeddings, so index-accelerated queries andpersist()on overlays throwSpeculativeOverlayError;get(), metadata-filterfind(), and filter-basedrelated()work fully. Commit the same ops withtransact()for the full surface.persist(path)— cuts an instant snapshot (hard links on filesystem storage; byte copies across devices; in-memory stores serialize to the same layout). Requires the view to still be the store's latest generation — otherwiseGenerationConflictError.release()— idempotent; after release every read throws. AFinalizationRegistrybackstop releases leaked pins at GC, but explicit release is what makescompactHistory()deterministic.
Db API errors
All exported from @soulcraft/brainy:
| Error | Thrown by | Meaning |
|---|---|---|
GenerationConflictError |
transact({ ifAtGeneration }), db.persist() |
The store moved past the expected generation — re-read and retry |
RevisionConflictError |
update({ ifRev }), transact() update ops |
Per-entity revision moved — see optimistic concurrency |
GenerationCompactedError |
asOf() |
The requested generation's records were reclaimed — persist what you must keep |
SpeculativeOverlayError |
index-accelerated reads / persist() on with() overlays |
Honest boundary: overlay entities carry no embeddings |
Virtual Filesystem (VFS)
Access via brain.vfs (property, not method). Auto-initialized during brain.init().
Filtering VFS Entities
All VFS entities (files/folders) have metadata.isVFSEntity: true set automatically.
Use this to filter VFS entities from semantic search results:
// Exclude VFS entities from semantic search
const semanticOnly = await brain.find({
query: 'artificial intelligence',
where: {
isVFSEntity: { notEquals: true } // Only semantic entities
}
})
// Or filter to ONLY VFS entities
const vfsOnly = await brain.find({
where: {
isVFSEntity: { equals: true } // Only VFS files/folders
}
})
// Check if an entity is a VFS entity
if (entity.metadata.isVFSEntity === true) {
console.log('This is a VFS file or folder')
}Why this matters: Without filtering, VFS files/folders can appear in concept explorers and semantic search results where they don't belong.
Basic File Operations
`vfs.readFile(path, options?)` → `Promise`
Read file content.
const content = await brain.vfs.readFile('/docs/README.md')
console.log(content.toString())`vfs.writeFile(path, data, options?)` → `Promise`
Write file content.
await brain.vfs.writeFile('/docs/README.md', 'New content', {
encoding: 'utf-8'
})`vfs.unlink(path)` → `Promise`
Delete a file.
await brain.vfs.unlink('/docs/old-file.md')Directory Operations
`vfs.mkdir(path, options?)` → `Promise`
Create directory.
await brain.vfs.mkdir('/projects/new-app', { recursive: true })`vfs.readdir(path, options?)` → `Promise`
List directory contents.
const files = await brain.vfs.readdir('/projects')
// With file types
const entries = await brain.vfs.readdir('/projects', { withFileTypes: true })
entries.forEach(entry => {
console.log(entry.name, entry.isDirectory() ? 'DIR' : 'FILE')
})`vfs.rmdir(path, options?)` → `Promise`
Remove directory.
await brain.vfs.rmdir('/old-project', { recursive: true })`vfs.stat(path)` → `Promise`
Get file/directory stats.
const stats = await brain.vfs.stat('/docs/README.md')
console.log(stats.size) // File size
console.log(stats.mtime) // Modified time
console.log(stats.isDirectory()) // Is directory?Semantic Operations
`vfs.search(query, options?)` → `Promise`
Semantic file search.
const results = await brain.vfs.search('React components with hooks', {
path: '/src',
limit: 10
})`vfs.findSimilar(path, options?)` → `Promise`
Find similar files.
const similar = await brain.vfs.findSimilar('/src/App.tsx', {
limit: 5,
threshold: 0.7
})Tree Operations
`vfs.getTreeStructure(path, options?)` → `Promise`
Get directory tree (prevents infinite recursion).
const tree = await brain.vfs.getTreeStructure('/projects', {
maxDepth: 3
})`vfs.getDescendants(path, options?)` → `Promise`
Get all descendants with optional filtering.
const files = await brain.vfs.getDescendants('/src', {
filter: (entity) => entity.name.endsWith('.tsx')
})Metadata & Relationships
`vfs.getMetadata(path)` → `Promise`
Get file metadata.
const meta = await brain.vfs.getMetadata('/src/App.tsx')
console.log(meta.todos) // Extracted TODOs
console.log(meta.tags) // Tags`vfs.getRelationships(path)` → `Promise`
Get file relationships.
const rels = await brain.vfs.getRelationships('/src/App.tsx')
// Returns: imports, references, dependencies`vfs.getTodos(path)` → `Promise`
Get TODOs from a file.
const todos = await brain.vfs.getTodos('/src/App.tsx')`vfs.searchEntities(query)` → `Promise>`
Search for semantic entities tracked by the VFS, filtered by type, name, or metadata.
const people = await brain.vfs.searchEntities({
type: 'person', // entity type filter
name: 'Ada', // semantic name search
where: { role: 'author' }, // metadata filters
limit: 50
})📖 Complete VFS Documentation →
Neural API
Access advanced AI features via brain.neural() (method that returns NeuralAPI instance).
`neural().similar(a, b, options?)` → `Promise`
Calculate semantic similarity.
// Simple similarity score
const score = await brain.neural().similar(
'renewable energy',
'sustainable power'
) // 0.87
// Detailed result
const result = await brain.neural().similar('text1', 'text2', {
detailed: true
})
console.log(result.score)
console.log(result.explanation)`neural().clusters(input?, options?)` → `Promise`
Automatic clustering. Accepts a text query, an array of texts, or a ClusteringOptions object.
const clusters = await brain.neural().clusters({
algorithm: 'kmeans', // 'auto' | 'hierarchical' | 'kmeans' | 'dbscan' | ...
maxClusters: 5,
minClusterSize: 3
})
clusters.forEach(cluster => {
console.log(cluster.label) // auto-generated label (when available)
console.log(cluster.members) // entity ids in the cluster
console.log(cluster.centroid) // centroid vector
})`neural().neighbors(id, options?)` → `Promise`
Find nearest neighbors.
const result = await brain.neural().neighbors(entityId, {
limit: 10,
minSimilarity: 0.7
})Options: limit?, radius?, minSimilarity?, includeMetadata?, sortBy?: 'similarity' | 'importance' | 'recency'
`neural().outliers(options?)` → `Promise`
Detect outlier entities.
const outliers = await brain.neural().outliers({ threshold: 0.3 })
// Each outlier carries the entity id plus outlier scoring detailsOptions: threshold?, method?: 'isolation' | 'statistical' | 'cluster-based', minNeighbors?, includeReasons?
`neural().visualize(options?)` → `Promise`
Generate visualization data.
const vizData = await brain.neural().visualize({
maxNodes: 100,
dimensions: 3,
algorithm: 'force',
includeEdges: true
})
// Use with D3.js, Cytoscape, GraphML toolsPerformance Methods
`neural().clusterFast(options?)` → `Promise`
Fast clustering for large datasets.
const clusters = await brain.neural().clusterFast({
maxClusters: 10
})Options: level? (hierarchy level), maxClusters?
`neural().clusterLarge(options?)` → `Promise`
Sampling-based clustering for very large datasets.
const clusters = await brain.neural().clusterLarge({
sampleSize: 1000,
strategy: 'diverse' // 'random' | 'diverse' | 'recent'
})Import & Export
`import(source, options?)` → `Promise`
Smart import with auto-detection (CSV, Excel, PDF, JSON, URLs).
// CSV import
await brain.import('data.csv', {
format: 'csv',
createEntities: true
})
// Excel import (all sheets processed automatically)
await brain.import('sales.xlsx', {
format: 'excel',
vfsPath: '/imports/sales', // optional: mirror into the VFS
groupBy: 'sheet'
})
// PDF import (tables extracted automatically)
await brain.import('research.pdf', { format: 'pdf' })
// URL import
await brain.import('https://api.example.com/data.json')Parameters:
source:string | Buffer | object- File path, URL, buffer, or objectoptions?: Import configurationformat?:'excel' | 'pdf' | 'csv' | 'json' | 'markdown' | 'yaml' | 'docx' | 'image'- Auto-detected if omittedvfsPath?:string- Mirror imported content into the VFS at this pathgroupBy?:'type' | 'sheet' | 'flat' | 'custom'- VFS grouping strategycreateEntities?:boolean- Create entities from rowscreateRelationships?:boolean- Create relationships between extracted entitiespreserveSource?:boolean- Save the original file in the VFSenableNeuralExtraction?:boolean- Extract entity names via AIenableRelationshipInference?:boolean- Infer relationships via AIenableConceptExtraction?:boolean- Extract entity types via AIconfidenceThreshold?:number- Minimum confidence for extracted entitiesonProgress?:(progress) => void- Progress callback (stage, counts, throughput, ETA)
Returns: Promise<ImportResult> - Import statistics
Export & Import (portable) + Snapshots (native)
Portable graph export/import — brain.export() / brain.import() (PortableGraph v1, versioned
JSON, partial-or-whole, cross-version). export() lives on the immutable Db, so it composes
with now()/asOf()/with():
// Export part or all of the brain to a portable, versioned document
const graph = await brain.export({ ids }, { includeVectors: true })
// Restore it — import() routes a PortableGraph to the graph round-trip (merge by id)
await otherBrain.import(graph, { onConflict: 'merge' })
// Time-travel export (serialize a past generation) / what-if export (a speculative state)
const past = await brain.asOf(gen)
const asWas = await past.export({ collection: id })
await past.release()
await brain.now().with(ops).export({ ids })Selectors: { ids }, { collection } (alias memberOf), { connected: { from, depth } },
{ vfsPath }, predicate ({ type, subtype, where, service }), or whole brain (omit). See the
Export & Import guide. Distinct from brain.import(file)
(CSV/PDF/Excel/JSON ingestion — import() dispatches on whether you pass a PortableGraph or a file).
Native whole-brain snapshot (generation-preserving, not portable JSON):
// Instant hard-link snapshot via the Db API
const pin = brain.now()
await pin.persist('/backups/2026-06-11')
await pin.release()
// Time-travel to a past generation or timestamp
const snapshot = await brain.asOf(new Date('2026-06-01'))
const entities = await snapshot.find({ limit: 100 })
await snapshot.release()Configuration
Constructor Options
const brain = new Brainy({
// Storage configuration
storage: {
type: 'filesystem', // 'memory' | 'filesystem' | 'auto'
rootDirectory: './brainy-data'
},
// Vector index configuration (2 knobs)
vector: {
recall: 'balanced', // 'fast' | 'balanced' | 'accurate'
persistMode: 'immediate' // 'immediate' | 'deferred'
},
// Model configuration (embedded in WASM - zero config needed)
// Model: all-MiniLM-L6-v2 (384 dimensions)
// Device: CPU via WASM (works everywhere)
// Cache configuration — `true`/`false`, or an options object
cache: {
maxSize: 10000,
ttl: 3600000 // 1 hour in ms
}
})
await brain.init() // Required! VFS auto-initializedStorage Adapters
Brainy 8.0 ships two adapters — both support the full Db API (generational history, snapshots, restore).
Memory (Default for Tests)
const brain = new Brainy({
storage: { type: 'memory' }
})Use case: Development, testing, prototyping
Filesystem (Default for Node)
const brain = new Brainy({
storage: {
type: 'filesystem',
rootDirectory: './brainy-data'
}
})Use case: Node.js applications, single-node production deployments
For off-site backup, snapshot rootDirectory from your scheduler (gsutil rsync, aws s3 sync, rclone, or tar) — Brainy itself doesn't reach out to cloud object stores.
Auto
const brain = new Brainy({
storage: { type: 'auto', rootDirectory: './brainy-data' }
})Picks 'filesystem' on Node with a writable rootDirectory, falls back to 'memory' otherwise.
Utility Methods
`clear()` → `Promise`
Clear all data (entities and relationships).
await brain.clear()`getNounCount()` → `Promise`
Get total entity count.
const count = await brain.getNounCount()`getVerbCount()` → `Promise`
Get total relationship count.
const count = await brain.getVerbCount()Subtype & facet APIs
Full guide: Subtypes & Facets.
`counts.bySubtype(type, subtype?)` → `Record | number`
O(1) subtype counts for a NounType (backed by the persisted rollup).
brain.counts.bySubtype(NounType.Person)
// → { employee: 12, customer: 847, vendor: 34 }
brain.counts.bySubtype(NounType.Person, 'employee')
// → 12`counts.topSubtypes(type, n=10)` → `Array<[subtype, count]>`
Top N subtypes ranked by count.
brain.counts.topSubtypes(NounType.Person, 3)
// → [['customer', 847], ['employee', 12], ['vendor', 34]]`subtypesOf(type)` → `string[]`
Sorted distinct subtypes seen for a NounType.
brain.subtypesOf(NounType.Person)
// → ['customer', 'employee', 'vendor']`counts.byRelationshipSubtype(verb, subtype?)` → `Record | number`
Verb-side mirror of counts.bySubtype. O(1) per-VerbType-per-subtype counts.
brain.counts.byRelationshipSubtype(VerbType.ReportsTo)
// → { direct: 12, 'dotted-line': 3 }
brain.counts.byRelationshipSubtype(VerbType.ReportsTo, 'direct')
// → 12`counts.topRelationshipSubtypes(verb, n=10)` → `Array<[subtype, count]>`
Top N subtypes for a VerbType ranked by count.
brain.counts.topRelationshipSubtypes(VerbType.ReportsTo, 3)
// → [['direct', 12], ['dotted-line', 3]]`relationshipSubtypesOf(verb)` → `string[]`
Sorted distinct subtypes seen for a VerbType.
brain.relationshipSubtypesOf(VerbType.ReportsTo)
// → ['direct', 'dotted-line']`audit(options?)` → `Promise` (7.30.1+)
Diagnostic — find entities and relationships missing a subtype value, grouped by type. The companion to migrateField() / fillSubtypes() — answers "what would break if I enabled strict subtype enforcement?".
const report = await brain.audit()
// {
// entitiesWithoutSubtype: { event: 24, document: 3 },
// relationshipsWithoutSubtype: { relatedTo: 1402 },
// total: 1429,
// scanned: 8400,
// recommendation: 'Found 1429 entries without subtype. ...'
// }Parameters:
options.includeVFS?:boolean— Whenfalse(default), VFS infrastructure entities (metadata.isVFSEntity/metadata.isVFS) are excluded. They bypass enforcement anyway, so counting them is noise.options.batchSize?:number— Pagination batch size (default 200).options.onProgress?:(progress: { scanned, missingSubtype }) => void— Progress callback per batch.
Run before adopting an SDK that registers requireSubtype() rules, or before upgrading to Brainy 8.0 (which makes strict mode the default). See the Strict mode in practice guide for the full migration recipe.
`requireSubtype(type, options?)` → `void`
Register subtype enforcement for a specific NounType or VerbType. Unified API for nouns and verbs. Composes with the brain-wide requireSubtype constructor flag.
// Lock down Person sub-classification
brain.requireSubtype(NounType.Person, {
values: ['employee', 'customer', 'vendor'],
required: true
})
// Lock down management edges
brain.requireSubtype(VerbType.ReportsTo, {
values: ['direct', 'dotted-line'],
required: true
})Parameters:
type:NounType | VerbType- The type to registeroptions.values?:string[]- Vocabulary whitelist (rejects off-vocab values)options.required?:boolean- Whether subtype is required (default:true)
Brain-wide strict mode — `new Brainy({ requireSubtype })`
Constructor option that enforces subtype on every add() / addMany() / update() / relate() / relateMany() / updateRelation() for every type:
// Every write must include subtype
const brain = new Brainy({ requireSubtype: true })
// Exempt specific types (e.g. catch-all Thing)
const brain2 = new Brainy({
requireSubtype: { except: [NounType.Thing, NounType.Custom] }
})When strict mode is on:
- Every public write path checks the pairing guarantee.
addMany()/relateMany()validate all items BEFORE any storage write — atomic-fail, no partial writes.- Brainy's own VFS infrastructure writes bypass via the
metadata.isVFSEntity: truemarker. - Per-type registrations always apply regardless of the brain-wide flag.
The default since 8.0.0 — pass requireSubtype: false to opt out while migrating pre-8.0 data.
`trackField(name, options?)` → `void`
Register a metadata field for cardinality + per-NounType breakdown stats. With values: [...], validates against the whitelist on add()/update().
brain.trackField('status') // basic
brain.trackField('status', { perType: true }) // with per-NounType breakdown
brain.trackField('priority', { values: ['low', 'med', 'high'] }) // strict vocabulary`counts.byField(name, options?)` → `Promise>`
Counts by value for a tracked field. Requires perType: true registration if filtering by NounType.
await brain.counts.byField('status')
// → { todo: 12, doing: 3, done: 47 }
await brain.counts.byField('status', { type: NounType.Task })
// → { todo: 8, doing: 2, done: 30 }`migrateField(options)` → `Promise`
Stream-and-rewrite a field across the brain. Supports metadata.X, data.X, and top-level paths. Idempotent.
// One-shot rewrite
await brain.migrateField({ from: 'metadata.kind', to: 'subtype' })
// Deprecation window — keep source field readable
await brain.migrateField({ from: 'data.kind', to: 'subtype', readBoth: true })
// With progress reporting
await brain.migrateField({
from: 'metadata.kind',
to: 'subtype',
batchSize: 500,
onProgress: ({ scanned, migrated }) => console.log(`${scanned} / ${migrated}`)
})Returns { scanned: number, migrated: number, skipped: number, errors: Array<{id, error}> }.
`fillSubtypes(rules, options?)` → `Promise` (8.0+)
Back-fill missing subtype values across entities AND relationships in one streaming pass — the migration companion to audit(). Keys are NounType/VerbType values; each rule is a literal subtype string or a function deriving one from the entry (return undefined to decline). Idempotent: entries that already carry a subtype are never touched, so a crashed run is resumed safely by re-running.
const report = await brain.fillSubtypes({
[NounType.Person]: (e) => e.metadata?.kind ?? 'unspecified', // derived
[NounType.Document]: 'general', // literal default
[VerbType.RelatedTo]: 'unspecified' // relationship rule
})
// → { scanned, filled, skipped, errors, byType }Parameters:
rules:FillSubtypeRules- Map of NounType/VerbType → literal subtype or(entry) => string | undefinedoptions.includeVFS?:boolean- Also fill VFS infrastructure entries (defaultfalse)options.batchSize?:number- Pagination batch size (default200)options.onProgress?:(progress: { scanned, filled, skipped }) => void- Per-batch callback
Returns { scanned, filled, skipped, errors, byType }. After a clean run, skipped equals the remaining audit().total. See the migration recipe.
`embed(data)` → `Promise` ✨
Generate embedding vector from text or data.
const vector = await brain.embed('Hello world')
// 384-dimensional vector
console.log(vector.length) // 384`embedBatch(texts)` → `Promise` ✨
Batch embed multiple texts using native WASM batch API (single forward pass).
const embeddings = await brain.embedBatch([
'Machine learning is fascinating',
'Deep neural networks',
'Natural language processing'
])
console.log(embeddings.length) // 3
console.log(embeddings[0].length) // 384Uses the WASM engine's native
embed_batch()for a single model forward pass instead of N individual calls. This is the same batch API used internally byhighlight().
`similarity(textA, textB)` → `Promise` ✨
Calculate semantic similarity between two texts.
const score = await brain.similarity(
'The cat sat on the mat',
'A feline was resting on the rug'
)
console.log(score) // ~0.85 (high semantic similarity)Returns: Score from 0 (different) to 1 (identical meaning)
`neighbors(entityId, options?)` → `Promise` ✨
Get graph neighbors of an entity.
// Get all connected entities
const neighbors = await brain.neighbors(entityId)
// Get outgoing connections only
const outgoing = await brain.neighbors(entityId, {
direction: 'outgoing',
limit: 10
})
// Multi-hop traversal
const extended = await brain.neighbors(entityId, {
depth: 2,
direction: 'both'
})Options:
direction:'outgoing' | 'incoming' | 'both'(default: 'both')depth:number- Traversal depth (default: 1)verbType:VerbType- Filter by relationship typelimit:number- Maximum neighbors to return
`findDuplicates(options?)` → `Promise` ✨
Find semantic duplicates in the database.
// Find all duplicates
const duplicates = await brain.findDuplicates()
for (const group of duplicates) {
console.log('Original:', group.entity.id)
for (const dup of group.duplicates) {
console.log(` Duplicate: ${dup.entity.id} (${dup.similarity.toFixed(2)})`)
}
}
// Find person duplicates with higher threshold
const personDupes = await brain.findDuplicates({
type: NounType.Person,
threshold: 0.9,
limit: 50
})Options:
threshold:number- Minimum similarity (default: 0.85)type:NounType- Filter by entity typelimit:number- Maximum duplicate groups (default: 100)
`indexStats()` → `Promise` ✨
Get comprehensive index statistics.
const stats = await brain.indexStats()
console.log(`Entities: ${stats.entities}`)
console.log(`Vectors: ${stats.vectors}`)
console.log(`Relationships: ${stats.relationships}`)
console.log(`Memory: ${(stats.memoryUsage.total / 1024 / 1024).toFixed(1)}MB`)
console.log(`Fields: ${stats.metadataFields.join(', ')}`)Returns:
entities- Total entity countvectors- Total vectors in the vector indexrelationships- Total relationships in graphmetadataFields- Indexed metadata fieldsmemoryUsage.vectors- Vector memory (bytes)memoryUsage.graph- Graph memory (bytes)memoryUsage.metadata- Metadata index memory (bytes)memoryUsage.total- Total memory usage
`cluster(options?)` → `Promise` ✨
Cluster entities by semantic similarity.
// Find all clusters
const clusters = await brain.cluster()
for (const cluster of clusters) {
console.log(`${cluster.clusterId}: ${cluster.entities.length} entities`)
}
// Find document clusters with centroids
const docClusters = await brain.cluster({
type: NounType.Document,
threshold: 0.85,
minClusterSize: 3,
includeCentroid: true
})Options:
threshold:number- Similarity threshold (default: 0.8)type:NounType- Filter by entity typeminClusterSize:number- Minimum cluster size (default: 2)limit:number- Maximum clusters to return (default: 100)includeCentroid:boolean- Calculate cluster centroids (default: false)
Returns:
clusterId- Unique cluster identifierentities- Array of entities in the clustercentroid- Average embedding vector (if includeCentroid is true)
`getStats(options?)` → `Promise`
Get complete entity/relationship statistics (convenience wrapper over brain.counts).
const stats = await brain.getStats()
console.log(stats.entities.total) // total entity count
console.log(stats.entities.byType) // counts per NounType
console.log(stats.relationships) // relationship stats
console.log(stats.density) // relationships per entity
// Exclude VFS infrastructure entities from the counts
const semanticOnly = await brain.getStats({ excludeVFS: true })Lifecycle
Initialization
const brain = new Brainy(config)
await brain.init() // Required! VFS auto-initialized hereVFS is auto-initialized during brain.init() - no separate vfs.init() needed!
Shutdown
await brain.close() // Graceful shutdown — flushes pending writes and releases the writer lockExamples
Basic CRUD
// Create
const id = await brain.add({
data: 'Quantum computing breakthrough',
type: NounType.Concept,
metadata: { category: 'tech', year: 2024 }
})
// Read
const entity = await brain.get(id)
// Update
await brain.update({
id,
metadata: { updated: true }
})
// Remove
await brain.remove(id)Knowledge Graphs
// Create entities
const ai = await brain.add({
data: 'Artificial Intelligence',
type: NounType.Concept
})
const ml = await brain.add({
data: 'Machine Learning',
type: NounType.Concept
})
// Create relationship
await brain.relate({
from: ml,
to: ai,
type: VerbType.IsA
})
// Traverse graph
const results = await brain.find({
connected: { from: ai, depth: 2 }
})Triple Intelligence Query
const results = await brain.find({
query: 'modern frontend frameworks', // 🔍 Vector
where: { // 📊 Document
year: { greaterThan: 2020 },
category: { oneOf: ['framework', 'library'] }
},
connected: { // 🕸️ Graph
to: reactId,
depth: 2,
type: VerbType.BuiltOn
},
limit: 10
})Database-as-a-Value Workflow
// Speculate: what would this change look like? (nothing touches disk)
const base = brain.now()
const whatIf = await base.with([
{ op: 'add', type: NounType.Document, subtype: 'note', data: 'New feature', metadata: { draft: true } }
])
await whatIf.find({ where: { draft: true } })
await whatIf.release()
await base.release()
// Commit it for real — one atomic generation, with audit metadata
await brain.transact(
[{ op: 'add', type: NounType.Document, subtype: 'note', data: 'New feature', metadata: { draft: true } }],
{ meta: { author: 'dev@example.com', message: 'Add new feature' } }
)VFS File Management
// Write files
await brain.vfs.writeFile('/docs/README.md', 'Project documentation')
await brain.vfs.mkdir('/src/components', { recursive: true })
// Read files
const content = await brain.vfs.readFile('/docs/README.md')
// Semantic search
const reactFiles = await brain.vfs.search('React components with hooks', {
path: '/src'
})
// Get tree structure (safe, prevents infinite recursion)
const tree = await brain.vfs.getTreeStructure('/projects', {
maxDepth: 3
})Type System Reference
Stage 3 CANONICAL taxonomy with 169 types (42 nouns + 127 verbs)
Noun Types (42)
Brainy uses a comprehensive noun type system covering 96-97% of human knowledge:
Core Entity Types (7)
NounType.Person- Individual human entitiesNounType.Organization- Companies, institutions, collectivesNounType.Location- Geographic and spatial entitiesNounType.Thing- Physical objects and artifactsNounType.Concept- Abstract ideas and principlesNounType.Event- Temporal occurrencesNounType.Agent- AI agents, bots, automated systems
Digital/Content Types (4)
NounType.Document- Text-based files and written contentNounType.Media- Audio, video, imagesNounType.File- Generic digital filesNounType.Message- Communication content
Business Types (4)
NounType.Product- Commercial productsNounType.Service- Service offeringsNounType.Task- Actions, todos, work itemsNounType.Project- Organized initiatives
Scientific Types (2)
NounType.Hypothesis- Theories and propositionsNounType.Experiment- Studies and investigations
And 25 more types including: Organism, Substance, Quality, TimeInterval, Function, Proposition, Collection, Dataset, Process, State, Role, Language, Currency, Measurement, Contract, Regulation, Interface, Resource, Custom, SocialGroup, Institution, Norm, InformationContent, InformationBearer, Relationship
Verb Types (127)
Brainy supports 127 relationship types organized into categories:
Foundational (7)
VerbType.InstanceOf,VerbType.SubclassOf,VerbType.ParticipatesInVerbType.RelatedTo,VerbType.Contains,VerbType.PartOf,VerbType.References
Spatial & Temporal (14)
- Location:
LocatedAt,AdjacentTo,ContainsSpatially,OverlapsSpatially,Above,Below,Inside,Outside,Facing - Time:
Precedes,During,OccursAt,Overlaps,ImmediatelyAfter,SimultaneousWith
Causal & Dependency (11)
- Direct:
Causes,Enables,Prevents,DependsOn,Requires - Modal:
CanCause,MustCause,WouldCauseIf,ProbablyCauses - Variations:
RigidlyDependsOn,FunctionallyDependsOn,HistoricallyDependsOn
Creation & Change (10)
- Lifecycle:
Creates,Transforms,Becomes,Modifies,Consumes,Destroys - Properties:
GainsProperty,LosesProperty,RemainsSame,PersistsThrough
Social & Communication (8)
MemberOf,WorksWith,FriendOf,Follows,Likes,ReportsTo,Mentors,Communicates
Epistemic & Modal (14)
- Knowledge:
Knows,Doubts,Believes,Learns - Mental states:
Desires,Intends,Fears,Loves,Hates,Hopes,Perceives - Modality:
CouldBe,MustBe,Counterfactual
Measurement & Comparison (9)
Measures,MeasuredIn,ConvertsTo,HasMagnitude,GreaterThanSimilarityDegree,ApproximatelyEquals,MoreXThan,HasDegree
And 54 more specialized verbs including ownership, composition, uncertainty, deontic relationships (obligations/permissions), context-dependent truth, spatial/temporal variations, information theory, and meta-level relationships.
Complete Reference
For the full taxonomy with all 169 types and their descriptions, see:
- Stage 3 CANONICAL Taxonomy - Complete list with categories
- Noun-Verb Taxonomy Architecture - Design rationale
Migration from pre-Stage-3 taxonomies
Breaking Changes:
NounType.Contentremoved → UseDocument,Message, orInformationContentNounType.Userremoved → UsePersonorAgentNounType.Topicremoved → UseConceptorCategory
New Types Added:
- +11 noun types: Agent, Organism, Substance, Quality, TimeInterval, Function, Proposition, Custom, SocialGroup, Institution, Norm, InformationContent, InformationBearer, Relationship
- +87 verb types: Extensive additions across all categories
Key Features
- ✅ Database as a Value -
brain.now()pins the whole store as an immutableDbin O(1) - ✅ Atomic Transactions -
brain.transact()commits multi-write batches all-or-nothing - ✅ Two-Level CAS - per-entity
ifRevand whole-storeifAtGeneration - ✅ Time Travel -
brain.asOf()serves the full query surface at any reachable past generation - ✅ Instant Snapshots -
db.persist()cuts hard-link snapshots;Brainy.load()opens them read-only - ✅ Speculative Writes -
db.with()answers what-if questions purely in memory - ✅ Reified Transaction Metadata - audit fields recorded durably, readable via
transactionLog() - ✅ VFS Entity Filtering - All VFS entities have the
isVFSEntity: trueflag - ✅ VFS Auto-Initialization - No separate
vfs.init()calls - ✅ VFS Property Access - Use
brain.vfs.method()instead ofbrain.vfs().method() - ✅ Universal Storage Support - Filesystem and memory adapters share one on-disk contract
Support & Resources
- 📖 Documentation: Full Documentation
- 🐛 Issues: GitHub Issues
- 💬 Discussions: GitHub Discussions
- 📦 NPM: @soulcraft/brainy
- ⭐ GitHub: Star us
See Also
- Data Model - Entity structure, data vs metadata, storage fields
- Query Operators - All BFO operators with examples and indexed vs in-memory matrix
- Triple Intelligence Architecture - How vector + graph + document work together
- Find System - Natural language find() details
- VFS Quick Start - Complete VFS documentation
- Import Anything Guide - CSV, Excel, PDF, URL imports
- Consistency Model - The guarantees behind the Db API
- Snapshots & Time Travel - Backup, restore, what-if, audit recipes
License: MIT © Brainy Contributors
Brainy - The Knowledge Operating System From prototype to planet-scale • Zero configuration • Triple Intelligence™ • Database as a Value