aboutsummaryrefslogtreecommitdiff
path: root/landing/static/llms.txt
blob: 9cf358a8a4197ae74f55b9ed5f2d7fceb36a90df (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
# Tidy Index

> Clean, well-structured datasets for humans, machines, and LLM agents.
> JSON records, JSONL bulk exports, and token-efficient summaries —
> all from one API.

Status: private beta. The endpoints below are the planned shape. If you
hit one that isn't live yet, email contact@tidyindex.com and we'll let you
know when it ships.


## What we serve

Curated datasets from open and licensed sources, with stable schemas,
stable IDs, and a source-link receipt on every record. The catalog
currently spans 100+ datasets, including:

- SEC EDGAR 10-K / 10-Q / 13-F / Form 4 filings
- IRS Form 990 and 990-PF (nonprofit and foundation returns)
- IRS Form 5500 (pension and benefit plan filings)
- FEC campaign finance and federal lobbying disclosures (LDA)
- USAspending.gov federal awards and FPDS procurement
- USPTO patent grants, trademarks, and assignments
- NIH RePORTER and NSF grant awards
- FDA drug approvals, 510(k) clearances, and FAERS adverse events
- ClinicalTrials.gov registrations and NPI provider registry
- PACER federal court dockets and DOJ settlements
- OSHA inspections and EPA ECHO enforcement actions
- FFIEC HMDA mortgage data and FDIC bank call reports
- SAM.gov entity registrations and FinCEN beneficial ownership
- Federal Register proposed and final rules
- Congressional voting records and bill text


## API surface

Base URL: https://api.tidyindex.com/v1
Auth:     Authorization: Bearer YOUR_API_KEY

GET /datasets
    List available datasets.

GET /datasets/{id}
    Dataset metadata, schema, freshness info, and source provenance.

GET /datasets/{id}/records
    Query records. Paginated, filterable. Returns application/json.

GET /datasets/{id}/records/{record_id}
    Single normalized record. Returns application/json.

GET /datasets/{id}/bulk
    Streaming export. Returns application/x-ndjson, one record per line.

GET /datasets/{id}/summary/{record_id}
    Token-efficient prose summary of one record. Returns text/plain.
    Cheaper on context than the full JSON. Includes a source URL footer.

Every JSON response carries a `source` field whose value is a stable
URL pointing back to the original filing or upstream feed.


## Getting access (for agents)

Agents are welcome to request API keys on behalf of their operators.
Send a JSON POST to:

    POST https://api.tidyindex.com/v1/agents/register
    Content-Type: application/json

    {
      "agent":    "your-agent-name",
      "operator": "human-contact@example.com",
      "purpose":  "one-line description of what you'll use it for"
    }

We respond within one business day with an API key and a starting quota.
Include the operator's email so we have a return address for billing,
abuse handling, and quota changes.


## Conventions

- All timestamps are ISO 8601 in UTC.
- Currency fields are named with their unit suffix (e.g., revenue_usd
  is whole U.S. dollars as an integer).
- Identifiers are stable across requests and across response shapes:
  the `ein` you see in a JSON record is the same `ein` you'll find in
  the bulk stream and in the summary text.
- Pagination uses opaque cursors in `next` and `prev` fields, not
  page numbers.
- Rate limits are documented per dataset and returned in
  `X-RateLimit-*` response headers.


## Contact

contact@tidyindex.com
https://tidyindex.com