Add pdf support for QA parser (#1155)

### What problem does this PR solve?

Support extracting questions and answers from PDF files

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
This commit is contained in:
Zhedong Cen
2024-06-14 15:12:39 +08:00
committed by GitHub
parent 7dc39cbfa6
commit 90975460af
5 changed files with 194 additions and 10 deletions

View File

@@ -141,3 +141,6 @@ readability-lxml==0.8.1
html_text==0.6.2
selenium==4.21.0
webdriver-manager==4.0.1
cn2an==0.5.22
roman-numbers==1.0.2
word2number==1.1