Add pdf support for QA parser (#1155)

### What problem does this PR solve?

Support extracting questions and answers from PDF files

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
This commit is contained in:
Zhedong Cen
2024-06-14 15:12:39 +08:00
committed by GitHub
parent 7dc39cbfa6
commit 90975460af
5 changed files with 194 additions and 10 deletions

View File

@@ -139,4 +139,7 @@ fasttext==0.9.2
volcengine==1.0.141
opencv-python-headless==4.9.0.80
readability-lxml==0.8.1
html_text==0.6.2
html_text==0.6.2
cn2an==0.5.22
roman-numbers==1.0.2
word2number==1.1