Fix off-by-one error in `parse_range_str` page range handling by yijinlee · Pull Request #107 · datalab-to/chandra

yijinlee · 2026-06-10T14:53:15Z

parse_range_str returns 1-based page numbers, but load_pdf_images iterates with a 0-based index. This causes --page-range 1 to return page 2, --page-range 2 to return page 3, etc.

Fix in chandra/input.py, parse_range_str.

Steps to reproduce error:

Take a PDF where page 1 and page 2 have distinct content
Run chandra input.pdf ./output --page-range 1
Output contains page 2's content instead of page 1's

Fix off-by-one error in parse_range_str page range handling

fe70b00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix off-by-one error in `parse_range_str` page range handling#107

Fix off-by-one error in `parse_range_str` page range handling#107
yijinlee wants to merge 1 commit into
datalab-to:masterfrom
yijinlee:patch-1

yijinlee commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yijinlee commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant