items definition) and keep the output deterministic and well-typed.
Prefer not to write JSON by hand? The Unsiloed dashboard has a schema builder with Manual and Auto-Suggest modes. In Auto-Suggest, describe the fields you want, upload an example document, and the dashboard generates a schema you can export and pass to
/v2/extract.Core Requirements
1. Root Object Every schema starts with"type": "object". Arrays and primitives aren’t allowed at the top level.
"properties" key. Each field must specify a "type" and should include a clear "description".
"required" array. Field names must exactly match those defined in "properties".
"additionalProperties": false at every object level to ensure only specified fields appear in output.
Supported Types
Extraction schemas support four field types: String: For text, dates, IDs, names, addresses, and any textual dataitems to define the structure of array elements
Building Schemas
Primitive Types
Primitive fields usestring, number, or boolean as their type. These are the building blocks of your schema.
Arrays of Objects
Use arrays when you have repeating data like line items, transactions, or people.Nested Arrays
For hierarchical data, nest an array inside the objects of another array.Example 1: Invoice Extraction
Example 1: Invoice Extraction
A common schema for extracting data from US invoices.
Example 2: Public Company Filing (10-K / Annual Report)
Example 2: Public Company Filing (10-K / Annual Report)
A schema for extracting governance and ownership information from US SEC filings.

