kalle07 commited on
Commit
3bc61ac
·
verified ·
1 Parent(s): 2778ed1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -32,8 +32,10 @@ I work with "<b>pdfplumber/pdfminer</b>" none OCR, so its very fast!<br>
32
  <li>Intelligent multiprocessing</li>
33
  <li>Error tolerant, that means if your PDF is not convertible, it will be skipped, no special handling</li>
34
  <li>Instant view of the result, hit one pdf on top of the list</li>
35
- <li>Converts some common tables as json inside the txt file</li>
36
- <li>It adds the absolute PAGE number to each page</li>
 
 
37
  <li>All txt files will be created in original folder of PDF</li>
38
  <li>All previous txt files are overwritten</li>
39
  <li>aprox 5 to 20 Pages/sec - depends on complexity</li>
 
32
  <li>Intelligent multiprocessing</li>
33
  <li>Error tolerant, that means if your PDF is not convertible, it will be skipped, no special handling</li>
34
  <li>Instant view of the result, hit one pdf on top of the list</li>
35
+ <li>Converts some common tables as json-foramt inside the txt file, readable for embedder</li>
36
+ <li>Adds the absolute PAGE number to each page</li>
37
+ <li>Adds the label “Chapter” for large font and/or “important” for bold font</li>
38
+ <li>tested on 300 PDF files ~30000 pages</li>
39
  <li>All txt files will be created in original folder of PDF</li>
40
  <li>All previous txt files are overwritten</li>
41
  <li>aprox 5 to 20 Pages/sec - depends on complexity</li>