Skip to content

Commit 8f17721

Browse files
committed
Donut
Donut
1 parent 5d1951c commit 8f17721

9 files changed

Lines changed: 130559 additions & 0 deletions

deep-learning/Transformer-Tutorials/Donut/CORD/Fine_tune_Donut_on_a_custom_dataset_(CORD)_with_PyTorch_Lightning.ipynb

Lines changed: 70676 additions & 0 deletions
Large diffs are not rendered by default.

deep-learning/Transformer-Tutorials/Donut/CORD/Quick_inference_with_DONUT_for_Document_Parsing.ipynb

Lines changed: 5903 additions & 0 deletions
Large diffs are not rendered by default.

deep-learning/Transformer-Tutorials/Donut/DocVQA/Creating_a_toy_DocVQA_dataset_for_Donut.ipynb

Lines changed: 20557 additions & 0 deletions
Large diffs are not rendered by default.

deep-learning/Transformer-Tutorials/Donut/DocVQA/Fine_tune_Donut_on_DocVQA.ipynb

Lines changed: 11551 additions & 0 deletions
Large diffs are not rendered by default.

deep-learning/Transformer-Tutorials/Donut/DocVQA/Quick_inference_with_DONUT_for_DocVQA.ipynb

Lines changed: 323 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# Donut 🍩 notebooks
2+
In this directory, you can find several notebooks that illustrate how to use Donut both for fine-tuning on custom data as well as inference. I've split up the notebooks according to the different downstream datasets:
3+
4+
- CORD (form understanding)
5+
- DocVQA (visual question answering on documents)
6+
- RVL-DIP (document image classification)
7+
8+
I've implemented Donut as an instance of [`VisionEncoderDecoderModel`](https://huggingface.co/docs/transformers/main/model_doc/vision-encoder-decoder) in the Transformers library.
9+
10+
The full documentation can be found [here](https://huggingface.co/transformers/main/model_doc/donut.html).
11+
12+
The models on the hub can be found [here](https://huggingface.co/models?search=donut).
13+
14+
Note that there's also several Gradio demos available for Donut, hosted as HuggingFace Spaces:
15+
- [DocVQA](https://huggingface.co/spaces/nielsr/donut-docvqa)
16+
- [RVLCDIP](https://huggingface.co/spaces/nielsr/donut-rvlcdip)
17+
- [CORD](https://huggingface.co/spaces/nielsr/donut-cord)
18+
19+
## Third-party resources
20+
21+
Also check out this [great blog](https://www.philschmid.de/fine-tuning-donut) by Philipp Schmid on fine-tuning Donut on the SROIE dataset :)

0 commit comments

Comments
 (0)