Need a Python dev to build a robust PDF-to-Excel parser. Must handle PDFs with variable layouts and inconsistent formatting.
• Read full PDFs, not just structured tables
• Extract rows of part-like data (IDs, names, prices)
• Detect categories (e.g. “new” vs “removed”)
• Normalize values (e.g. NP, Quote, $ formats)
• Export to Excel with a defined column structure
Let me know and we can discuss more details