LoreGraph product docs

Document Processing

Summary

Document processing turns uploaded source files into extracted text, sections, course outlines, lessons, and quiz drafts.


Who this is for

  • Workspace admins
  • Course creators

Before you start

  • Use a supported file type: PDF, Word .doc or .docx, or PowerPoint .pptx.
  • Make sure the document is readable and current.
  • Remove duplicate or irrelevant pages where possible.

Concepts

LoreGraph processes uploaded documents in stages: upload, text extraction, content cleanup, section detection, outline generation, lesson draft generation, and quiz generation.

PDFs can have page-count handling and may process differently based on size.

DOCX quality depends heavily on heading styles and clean document structure.

PPTX files may produce flatter outlines because slides do not always have a deep heading hierarchy.

Messy documents should be reviewed carefully before publishing generated content.

Steps

  1. Choose the source file.
  2. Confirm or enter a course title.
  3. Continue to generation options.
  4. Upload the file and wait for processing.
  5. Review the generated outline and source sections.
  6. Regenerate or edit sections if the document structure was messy.
  7. Generate lessons and quizzes only after the outline looks right.

Settings reference

SettingWhat it doesRecommended default
Supported file typesControls whether upload can proceed.PDF, .docx, and .pptx.
OCR or scanned contentAffects extraction quality for image-heavy pages.Use readable text PDFs when possible.
Outline reviewPrevents bad structure from flowing into lessons.Always review before generation.

Example

A slide deck with one process per slide can become a clear training outline. A deck with dense speaker notes and no slide titles may need manual outline cleanup.

Common mistakes

  • Uploading a scanned PDF and assuming every line extracted perfectly.
  • Ignoring duplicated appendix pages.
  • Generating lessons before fixing a weak outline.
  • Treating complex tables as final without human review.

Supported file types

  • PDF
  • Word documents (.docx)
  • PowerPoint presentations (.pptx)

Planned formats

  • Transcripts
  • Audio
  • Video
  • HTML pages
  • Google Drive
  • Notion
  • Website URLs

Last updated: May 30, 2026