Skip to main content
Pinecone Assistant limits vary based on subscription plan.

Object limits

Object limits are restrictions on the number or size of assistant-related objects. Limits in this table are per project today; some may move to per organization in the future.
MetricStarter planStandard planEnterprise plan
Assistants per project5UnlimitedUnlimited
File storage per project1 GBUnlimitedUnlimited
Chat input tokens per project500,000 / monthUnlimitedUnlimited
Chat output tokens per project300,000 / monthUnlimitedUnlimited
Context retrieval tokens per project500,000 / monthUnlimitedUnlimited
Ingestion units per project1,000 / monthUnlimitedUnlimited
Evaluation input tokens per projectNot available150,000500,000
Files per assistant100UnlimitedUnlimited
File size (.docx, .json, .md, .txt)10 MB10 MB10 MB
File size (.pdf)10 MB100 MB100 MB
Metadata size per file16 KB16 KB16 KB
Additionally, the following limits apply to multimodal PDFs (currently in public preview): Multimodal PDF processing uses the same ingestion unit as standard uploads; it is billed at about twice the standard per-unit rate (see Pricing and limits). Object and rate limits for assistants also apply—see #limits and #rate-limits.
MetricStarter planStandard planEnterprise plan
Max file size10 MB50 MB50 MB
Page limit100100100
Multimodal PDFs per assistant102020

Rate limits

Rate limits help protect your applications from misuse and maintain the health of our shared infrastructure. These limits are designed to support typical production workloads while ensuring reliable performance for all users. Most rate limits can be adjusted upon request. If you need higher limits to scale your application, contact Support with details about your use case. Requests that exceed a rate limit fail and return a 429 - TOO_MANY_REQUESTS status.
To handle rate limits, implement retry logic with exponential backoff.
MetricStarter planStandard planEnterprise plan
Assistant list/get requests per minute40100500
Assistant create/update requests per minute2050100
Assistant delete requests per minute2050100
File get requests per minute1003006,000
File list requests per minute501503,000
File upload requests per minute520300
Multimodal PDF upload requests per minute520300
File delete requests per minute520300
Chat input tokens per minute100,000300,0001,000,000
Chat history tokens per query64,00064,00064,000