Paper to Slide/Presentation converter in One Click
P

Paper to Slide/Presentation converter in One Click

Paper to Slide/Presentation converter in One Click

CLI
3,739 stars
N/A forks
N/A contributors

README

Project documentation from GitHub

Paper2Slides: Turn Research Papers into Presentations in One Click

Let's be honest: turning a dense academic paper into a clear, engaging presentation is a chore. You're flipping between sections, trying to distill complex ideas into bullet points, and hunting for key figures—all while the clock ticks toward your deadline. What if you could skip that grunt work entirely?

Enter Paper2Slides, an open-source tool that automates the heavy lifting. It takes a research paper (PDF) and generates a structured slide deck for you. It's not just a simple text extractor; it intelligently identifies the core components of a paper and maps them into a standard presentation format.

What It Does

Paper2Slides is a Python-based tool that processes a PDF of an academic paper and outputs a PowerPoint presentation (.pptx). It uses a combination of layout analysis and natural language processing to break the paper down. It looks for the standard sections—Abstract, Introduction, Methodology, Results, Conclusion—and pulls out the most salient sentences and figures. Then, it organizes this content into a logical slide flow, complete with titles, bullet points, and embedded images.

The goal isn't to produce a final, polished presentation you'd use without review. Instead, it gives you a robust, coherent first draft in seconds, saving you hours of manual copying, pasting, and formatting.

Why It's Cool

The clever part is in the pipeline. It doesn't just naively split text. First, it parses the PDF to understand its structure, separating text blocks from figures and tables. Then, it classifies which part of the paper each block belongs to. Using NLP techniques, it scores sentences within each section for importance, selecting the ones that best summarize the content for a slide format. Finally, it stitches it all together with the python-pptx library, creating clean, standardized slides.

For developers and researchers, this is a neat example of a practical NLP/ML pipeline applied to a real-world problem. The code is modular, so you could tweak the sentence selection algorithm, adjust the slide template, or even retrain the section classifier for different types of documents. It’s a great starting point for anyone interested in document understanding automation.

How to Try It

The project is on GitHub, and getting it running is straightforward if you have a Python environment.

  1. Clone the repo:
    git clone https://github.com/HKUDS/Paper2Slides.git
    cd Paper2Slides
    
  2. Install the required packages (check the requirements.txt in the repo):
    pip install -r requirements.txt
    
  3. The main script is paper2slides.py. Run it with the path to your target PDF:
    python paper2slides.py --pdf_path /path/to/your/paper.pdf
    

Did you like this issue?

Join our weekly newsletter

Related Projects

Love discovering amazing projects?

Help us continue bringing you the best open-source discoveries every week.

Back to Projects
Last updated: Dec 10, 2025