Company Alignment Automation
Unified system to normalize, organize, and classify company data across sources
This project focused on aligning company data that was previously scattered across four separate spreadsheets, each using different naming conventions and levels of quality. The automation ingests multiple datasets and standardizes company names by removing legal suffixes (such as Ltd, LLC, SA), correcting formatting inconsistencies, and identifying duplicate or overlapping entries. Companies are then matched and unified into a single structured dataset, ensuring that each organization is represented once and consistently. On top of normalization, the system classifies companies by attributes such as quality, level of interest, and relevance, allowing users to filter, search, and segment companies efficiently through a software interface. The result is a reliable internal system that replaces manual spreadsheet cleanup, improves data accuracy, and gives teams a clear, consistent view of company information for downstream workflows and decision-making.
Key Features
- Ingestion of company data from multiple spreadsheet sources
- Normalization of company names and removal of legal suffixes
- Duplicate detection and entity matching across datasets
- Centralized company directory with clean, unified records
- Classification by quality, interest, and relevance
- Fast search and filtering through a software interface