TY - GEN
T1 - A machine learning approach to detect early signs of startup success
AU - Thirupathi, Abhinav Nadh
AU - Alhanai, Tuka
AU - Ghassemi, Mohammad M.
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/11/3
Y1 - 2021/11/3
N2 - In this study, we investigate a heterogeneous set of startup ventures (different ages, products, teams, levels of maturity, etc.) to identify the time-independent factors associated with their future success. More specifically, we investigated 3,160 unique companies, all of which were recipients of Small Business Innovation Research (SBIR) or Small Business Technology Transfer (STTR) awards. For each company, we collected any publicly available information: the SBIR/STTR award (amount, agency, principal investigator, etc.), and Crunchbase business profile. The collected data were used to train a XGBoost model that predicts whether a company had an initial public offering (IPO), and/or merged with, and/or was acquired by another entity (M&A). The performance of the model assessed using leave one-out-cross validation (LOOCV) was strong: 84% accuracy and 0.91 AUC. We found that employees with entrepreneurial experience, arts, and/or STEM educational backgrounds, among other characteristics played a significant role in predicting the success of small businesses. Our results indicate that machine learning models may be used to assess the viability of small ventures.
AB - In this study, we investigate a heterogeneous set of startup ventures (different ages, products, teams, levels of maturity, etc.) to identify the time-independent factors associated with their future success. More specifically, we investigated 3,160 unique companies, all of which were recipients of Small Business Innovation Research (SBIR) or Small Business Technology Transfer (STTR) awards. For each company, we collected any publicly available information: the SBIR/STTR award (amount, agency, principal investigator, etc.), and Crunchbase business profile. The collected data were used to train a XGBoost model that predicts whether a company had an initial public offering (IPO), and/or merged with, and/or was acquired by another entity (M&A). The performance of the model assessed using leave one-out-cross validation (LOOCV) was strong: 84% accuracy and 0.91 AUC. We found that employees with entrepreneurial experience, arts, and/or STEM educational backgrounds, among other characteristics played a significant role in predicting the success of small businesses. Our results indicate that machine learning models may be used to assess the viability of small ventures.
KW - Business Success
KW - Data Mining
KW - Ensemble methods
KW - Factors Extraction
KW - Startups
KW - Venture Capital
UR - http://www.scopus.com/inward/record.url?scp=85130523067&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130523067&partnerID=8YFLogxK
U2 - 10.1145/3490354.3494374
DO - 10.1145/3490354.3494374
M3 - Conference contribution
AN - SCOPUS:85130523067
T3 - ICAIF 2021 - 2nd ACM International Conference on AI in Finance
BT - ICAIF 2021 - 2nd ACM International Conference on AI in Finance
PB - Association for Computing Machinery, Inc
T2 - 2nd ACM International Conference on AI in Finance, ICAIF 2021
Y2 - 3 November 2021 through 5 November 2021
ER -