Parsing creates machine-ready text from PDFs, images, or HTML snippets.
When to use parsing
- Feed OCR’d text into LLM prompts.
- Generate embeddings for semantic search.
- Pre-process files before custom NLP pipelines.
Studio workflow
- Upload a document at https://app.algorythmos.fr.
- Select Parsing and choose the output format: plain text or structured Markdown.
- Enable Preserve layout if downstream systems rely on page markers.
API example
curl -X POST https://api.algorythmos.fr/parse/file \
-H "x-api-key: $ALG_KEY" \
-F "file=@/path/to/report.pdf" \
-F "format=markdown"The response returns a parser_run_id. Poll GET /parse/runs/{parser_run_id} until status becomes succeeded, then download the parsed content from the output_uri.