The CIO’s Guide to AI – Leveraging Scarce Skills

By George Gilbert | December 31, 2017

Premise

Machine learning (ML) and AI have seized the collective imagination of enterprises everywhere as the most transformational technology in generations. These technologies are the focus of intensive experimentation, following similar experimentation with on-prem Hadoop-based big data projects. But if mainstream enterprises are going to leverage AI and ML widely over the next several years, they are going to have to pick their way carefully through an obstacle course. There just aren’t enough skilled data scientists to work at all levels of the ML and AI food chain.

On-premises, Hadoop-based, big data investments were going to cure cancer, taste like chocolate, and cost a dollar. But a majority of Hadoop projects at mainstream enterprises struggled to make it past the pilot phase. Why? Because of a lack of two classes of skills. First, Hadoop is comprised of many complex, open source projects that aren’t built to work together seamlessly; mainstream enterprises didn’t have the skills in-house to manage the tool composites that characterized typical Hadoop initiatives. Second, most enterprises lack a pool of qualified data scientists. Moreover, the tools intended to facilitate the construction and operationalization of complex big data models offered limited capabilities.

The same set of issues are emerging in the AI and ML world. Addressing this skills shortage requires that CIOs and LOB executives review their ML and AI projects from the perspective of leverage. Mainstream enterprises have to limit the type of work they can undertake, for now:

Leave new algorithm development to tech companies and the research community. At the most sophisticated level, scientists and researchers create and train new algorithms using general-purpose, but low-level, frameworks or engines (see the “deep data science” level in figure 1).
Build applications based on mature AI/ML patterns. Enterprises with sophisticated talent pools of data scientists or statisticians can build applications that are based on relatively well-known design patterns such as fraud prevention or recommenders (see the “developer ready” level in figure 1).

Figure 1: Machine learning building blocks on the left present higher levels of abstraction starting from the bottom. At the lowest level, AI engines are suitable for a relatively small number of AI/ML developers, typically with advanced degrees, who are building new algorithms. AI platforms, at the next level, typically share common algorithms with differentiation usually coming from tooling. AI/ML APIs package pre-trained models that are ready for mainstream developers. At the highest level, semi-custom applications integrate one or more pre-trained AI/ML models within a business process in order to inform or automate decisions.

Leave New Algorithm Development To Tech Companies And The Research Community.

Both tech vendors with a focus on AI/ML as well as the research community can push the state of the art in two ways. First, they can apply existing technologies to the novel, previously intractable problems, such as diagnosing diseases via radiology images. Second, at the extreme end, tech vendors and researchers can create new paradigms for AI/ML. Perhaps the most prominent example is the reinforcement learning that drove the simulations that let Alpha Go Zero teach itself to beat a previous version of itself, which had already beaten the world’s best player.

There are only an estimated 10,000 AI/ML researchers at this level. Universities and technology-centric firms do most of the development here because they can attract more of this talent. Firms can pay more and offer immense data sets and compute power for experimentation. Most of all these firms can offer an internal community of world-class talent. Salaries at vendors for those with advanced degrees can start at $350,000-$500,00 per year, plus equity. Based on a web crawl and an analysis of jobs, projects, and other activity, Spiderbook estimates that almost 45% of all companies investing in AI/ML come from software, IT services, internet, and research organizations (see figure 2). No one has figured out how to democratize access to these technologies yet.

*Figure 2: Ranking of AI investments by industry based on a detailed web crawl using software designed to analyze account activity for B2B marketing. (SOURCE: Spiderbook/Demandbase)*

Build Applications Based On Mature AI/ML Patterns.

Prototypical AI/ML applications that have a high and measurable ROI include fraud prevention and personalization engines, such as next best action or recommendations. Microsoft and IBM have extensive template libraries to help customers get started. Templates remove the need for customers to figure out what algorithms to select. The templates also help prioritize what input data is required. Developer tooling here is improving. Microsoft introduced an Azure Machine Learning Experimentation service and Model Management service which collectively augment the design and deployment stages of the ML lifecycle, both of which have been relatively dark arts. These new services leverage Microsoft’s historical strength in developer tools and bring them into “data ops,” the data science equivalent of devops. IBM has deep and highly strategic industry solutions for IoT, building on its long history of working closely with large customers.

While templates exist for these types of models, they require varying levels of customization. Enterprises don’t all have input data sets that are identical. So there is a need for additional feature selection and engineering, training data with labels, and model evaluation. While Microsoft and many others are building out the tooling, managing the design-time and runtime data science pipelines is still immature. Most enterprises will need to draw on thin pools of data science expertise. Data scientists have to be in the loop for both design-time and run-time pipelines. At design-time, they have to prep and label data and evaluate model fitness. At run-time, they have to continually evaluate live model performance and continually retrain the models using data feedback loops. In IBM’s case, the industry solutions are templates that require joint development in order to bring together customer domain expertise and IBM’s knowledge of IoT and ML technology.

Wikibon believes the skills base for ML talent outside the tech community is somewhat thin. Based on the same Spiderbook web crawl that identified AI/ML vertical activity in figure 2, the number of companies and the maturity of their investments is quite small (see figure 3). Another way of assessing the talent pool is to look at the number of developers trained on Python and R. Based on that count, there are 1.5-3M data scientists and data engineers compared with 8M developers trained on the previous, mainstream analytic software, SAS. Wikibon believes these numbers are misleading in terms of how many engineers can manage the entire ML model design process. Our research indicates a large majority of this population does data wrangling and preparation. O’Reilly’s 2016 salary survey indicates the median base salary across these skills is $106K in the U.S.

Figure 3: A maturity model of AI investment indicates that the number of companies with strategic commitments to AI is well below 100. This number is consistent with figure 2 showing that most activity is taking place in the tech vertical. The large number of lab projects is consistent with the widespread experimentation with on-prem big data projects over the last 5 years. (SOURCE: Spiderbook/Demandbase)

Action Item

In order for mainstream CIOs to avoid having their AI/ML pilots bog down – or unnecessarily fail outright – CIOs and line of business (LOB) executives need to inventory their in-house skills and establish concrete strategies for acquiring those skills or transferring AI/ML knowledge into their organizations in some other, non-labor form.

Article Categories

By George Gilbert | December 31, 2017

George Gilbert

George Gilbert, lead data & analytics analyst for theCUBE Research. Former Gartner analyst, former lead enterprise software analyst for Credit Suisse First Boston, one of the top investment banks serving the technology sector. Big Data analyst for Gigaom Research. Co-founded Techalphapartners, a consultancy that advised vendors and institutional investors on market development and product strategy. George has led conference panels with prominent thought leaders in cloud infrastructure and big data. He has been profiled on the front page of the Wall Street Journal and published as a guest author in a major overview of the evolution of cloud computing in The Economist. Prior to being an analyst, George was a product manager on Notes at Lotus Development. George received his BA in economics from Harvard University.

You may also be interested in

SUSE Expands Digital Sovereignty Support for the European Union

Paul Nashawaty July 15, 2025

Measuring Cloud Optimization with the New Effective Avoidance Rate

Paul Nashawaty July 15, 2025

Cutting Edge Research, Analysis, Insights + Media

Studio Locations

Silicon Valley
989 Commercial St.
Palo Alto, CA 94303

Boston Metro
5 Mount Royal Ave.
Marlborough, MA 01752

Research Areas

Podcasts

Solutions

Engage

Stay Connected

theCUBE Research weekly

Stay ahead of the curve with the exclusive insights by our team straight to your inbox each week.

By submitting this form, you are consenting to receive marketing emails from: theCUBEResearch, info@siliconangle.com. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact