Skip to content

Latest commit

 

History

History
42 lines (27 loc) · 2.49 KB

README.md

File metadata and controls

42 lines (27 loc) · 2.49 KB

1. Fine-tuning on Document AI Tasks

This repo has my implementation of models used for fine-tuning on document AI task.

  1. Finetuning LayoutLMV3 on FUNSD Dataset
  2. Finetuning LiLT on FUNSD Dataset
  3. Finetuning RoBERTa on FUNSD Dataset
  4. Finetuning DocFormer on FUNSD Dataset

Current Working:

WANDB Report:

FUNSD Dataset:

  • The wandb progress report can be found out here here

2. Paper Implementations:

Natural Language Processing

  • DocFormer: End-to-End Transformer for Document Understanding [Paper] [Code]
  • LiLT: A Simple yet Effective Language-Independent Layout Transformer,for Structured Document Understanding [Paper] [Code]
  • EIT: Enhanced Interactive Transformer [Paper][Code]

Visual Question Answering

  • LaTr: Layout-aware transformer for scene-text VQA [Paper] [Code]

Computer Vision

3. Projects: