Update README.md
Browse files
README.md
CHANGED
@@ -32,8 +32,10 @@ I work with "<b>pdfplumber/pdfminer</b>" none OCR, so its very fast!<br>
|
|
32 |
<li>Intelligent multiprocessing</li>
|
33 |
<li>Error tolerant, that means if your PDF is not convertible, it will be skipped, no special handling</li>
|
34 |
<li>Instant view of the result, hit one pdf on top of the list</li>
|
35 |
-
<li>Converts some common tables as json inside the txt file</li>
|
36 |
-
<li>
|
|
|
|
|
37 |
<li>All txt files will be created in original folder of PDF</li>
|
38 |
<li>All previous txt files are overwritten</li>
|
39 |
<li>aprox 5 to 20 Pages/sec - depends on complexity</li>
|
|
|
32 |
<li>Intelligent multiprocessing</li>
|
33 |
<li>Error tolerant, that means if your PDF is not convertible, it will be skipped, no special handling</li>
|
34 |
<li>Instant view of the result, hit one pdf on top of the list</li>
|
35 |
+
<li>Converts some common tables as json-foramt inside the txt file, readable for embedder</li>
|
36 |
+
<li>Adds the absolute PAGE number to each page</li>
|
37 |
+
<li>Adds the label “Chapter” for large font and/or “important” for bold font</li>
|
38 |
+
<li>tested on 300 PDF files ~30000 pages</li>
|
39 |
<li>All txt files will be created in original folder of PDF</li>
|
40 |
<li>All previous txt files are overwritten</li>
|
41 |
<li>aprox 5 to 20 Pages/sec - depends on complexity</li>
|