we start with some automated content acquisition, before we use nlp for keyword extraction and tagging. The company-tag description is then located in a huge tech-domain-specific vector space. From vector space similarities and a couple of heuristics, we derive the company classification.
Worked with Orient back in 2012, had some issues regarding performance and switched. Following the news about them, I thought they had made great progress. This benchmark kind of shows the exact opposite.
Really helpful description for things to watch masteriung a shift to graphs. It really is a challenge. The 'triviality gap' was a heads up, especially for veteran IT staff.