Skip to content
Blueprint Technologies - Data information specialists
Main Menu
  • What we do

      Artificial Intelligence

      Intelligent SOP
      Generative AI
      Video analytics

      Engineering

      Application development
      Cloud & infrastructure
      Lakehouse optimization

      Data & Analytics

      Data platform modernization
      Data governance
      Data management
      Data migration
      Data science & analytics

      Strategy

      TCO planning
      Productization
      Future proofing
  • Industries

      Manufacturing

      Enhance productivity and efficiency through tailored technology solutions, optimizing processes, and drive innovation in manufacturing operations.

      Retail

      Revolutionize customer experiences through innovative technology solutions for seamless shopping journeys and enhanced retail operations.

      Health & Life Sciences

      Advance healthcare outcomes and pharmaceutical innovations through cutting-edge technology solutions and data-driven strategies.

      Financial Services

      Empower financial institutions with secure and scalable technology solutions, driving digital transformation, and personalized customer experiences.

  • Databricks

      Databricks
      Center of Excellence

      Maximize your Databricks experience with our comprehensive Center of Excellence resources and support.

      QuickStarts

      Proof-of-value projects designed to get you started quickly on Databricks.

      Accelerated Data Migration

      Regardless of the source, we specialize in migration your data to Databricks with speed and quality.

      Unity Catalog Migration

      Accelerate your UC migration and minimize errors with our meticulously tested Brickbuilder approved solution.

      Lakehouse Optimizer

      Get higher return on your investment and minimize your total cost of ownership with self-facilitated optimization.

      Accelerated Snowflake to Databricks Migration

      Unlock increased cost savings, heightened operational efficiency, and enhanced analytical capabilities. 

  • Our work
  • Insights
  • About

      Our Approach

      Discover our holistic approach to uncovering strategic opportunities.

      Careers

      Explore exciting career opportunities and join our team today.

      News

      Get the latest updates and insights about our company.

      Events

      Stay updated on upcoming events and webinars.

      Our Partners

      Get to know our trusted technology partners and collaborators.

Connect
Blueprint Technologies - Data information specialists

The basics of data science: Part 2

By Blueprint Team

Have you read The basics of data science: Part 1?  Click here

At Blueprint, one of our key practice areas is data science and analytics. However, we still find that many do not understand the basics of data science, nor do they accurately estimate the benefits to be gained from getting the most from their company’s data. We interviewed our Solutions Architect and Data Science expert, Manish Shukla, to help answer some of the most asked questions.

What are the opportunities companies miss out on if they don’t develop their data science practice?

MS: If you do not invest in and maximize data science, you can never be a data-driven organization. That is a very powerful, but accurate statement. All organizations must make decisions based on the best, most relevant and accurate data, but many misunderstand resource allocation. Data lakes and BI teams don’t make an organization data-driven. Without data science, all you have is historical data in an “as-it-is” state, with no ability to use it to predict and plan. That ability comes through investment in data science. Without it, companies will always trail competitors who can make data-driven decisions.

A thriving data science practice also grounds the entire organization in one rock-solid point of truth for data. Data science gives you more options, tools and overall firepower, so not investing in data science is truly a losing proposition for anyone.

What are some of the challenges to starting that process they might not be aware of?

MS: Building a data science pipeline is not an easy process, but with some forethought many challenges can be avoided. The first key is to take the time to plan and consider your resources. Often a data scientist will want to investigate something, so they will get access to some data, analyze it and circulate some derived intelligence around the company. As more people want more, deeper analysis, they run into issues because the process is not scalable. It really is crucial to consider both infrastructure and end goals when developing a data science practice.

Team makeup is another key consideration that can help avoid challenges along the way. For example, there are generally three types of data scientists. Some have a strong background in software engineering, while others have a completely mathematical or statistical background. If the team is not properly constructed based on skillsets, they may run into difficulties when the process requires advanced coding, for example. It is vital that both data science and engineering teams be built in a balanced fashion so they can readily handle changing circumstances and demands.

Are there particular industries that benefit the most from a strong data science practice?

MS: It would be easier to name industries that wouldn’t benefit, but really any industry that relies on consumer demand would be well served to fully embrace data science. Whether it’s telecommunications, retail, finance, consumer packaged goods, the potential is massive. All those industries rely on and thrive when driven by consumer data. The use of this data to drive crucial decisions is the reason we invest in data science.

What is an example of a cool or impactful thing you have seen come from data science?

MS: I can think of a couple from my time working with Coca-Cola. The time I joined was an interesting time in Coke history, where people were worrying about the side effects of aspartame and becoming more health conscious in general. The soda business was certainly feeling those effects, and had been shrinking for some time, especially in the U.S.

I joined to help improve Coke’s data science practice and maximize its impact. Coke has an interesting challenge when it comes to customer data. Since Coke’s products are purchased mainly through grocery and convenience stores, it can be difficult to know enough about individual customers to effectively market to them, even at a regional level. Supermarkets and convenience stores share sales information with Coke, but they do not have the ability to share data on the customers doing the purchasing.

One way we began to collect data at the customer level is with the introduction of the Coca-Cola Freestyle machine. If you’re not familiar with it, the Freestyle is a soda fountain that allows customers to choose from any combination of 165 flavors of drinks, mixing them any way they like. The Freestyle is all about collecting customer preference data, and it has been a huge success. I included customer preference from the 20 million customers using the Freestyle machines as one of 30 different data sources that populated our data lake. Using the correlations we were able to determine through our work, we helped drive the addition of two new soda brands to the company’s portfolio, as well as determine the optimal regions to launch the new products.

What was the other one?

MS: It was around supply chain and inventory management. Coke had long had access to advanced demographic, financial and sales information, but the source of much of the data was external, so it wasn’t always exactly what we needed or in the format we would’ve liked. It was also a significant cost center. Once we had the data flowing into the data lake, though, we were able to eliminate that reliance on third-party data vendors and perform much more efficient, impactful data science. We were able to go beyond the basic daily sales numbers and provide reasons behind the performance, as well as recommendations to improve it. We derived recommendations in a number of ways, but one of the most interesting was by using natural language processing (NLP) to mine social media data to find interesting correlations. The impact was immediate and profound. Coke began to change many of their fulfilment methods and tactics to select, optimize and restock their products. This really brought the value of data science into focus, and soon Coke began partnering with supermarkets to add certain sensors to capture pictures and video, further expanding their available universe of customer data.

These are the kinds of advancements and capabilities that a strong data science practice can bring to your organization.

Blueprint Technologies is Here to Help

Blueprint Technologies is here to help you establish or improve your data science and analytics function and unlock the true power and value of your data. 

Share with your network

You may also enjoy

Classic vs. Serverless: Exploring Databricks’ latest Innovations

Explore the benefits of Databricks’ serverless solutions, which simplify resource management, improve productivity, and optimize costs. Discover key insights and best practices to enhance your data strategy with cutting-edge serverless technologies.

Help for FinOps Leaders – How the Lakehouse Optimizer can assist with your Lakehouse 

Discover how FinOps leaders manage cloud and data costs effectively while maximizing business value. Learn how the Lakehouse Optimizer (LHO) addresses common business problems through discovery, optimization, and operation.
Blueprint Technologies - Data information specialists

What we do

  • Generative AI
  • Video analytics
  • Application development
  • Cloud and infrastructure
  • Data platform modernization
  • Data governance
  • Data management
  • Data science and analytics
  • TCO Planning 
  • Productization
  • Future Proofing
  • Intelligent SOP
  • Lakehouse Optimization
  • Data Migrations
  • Generative AI
  • Video analytics
  • Application development
  • Cloud and infrastructure
  • Data platform modernization
  • Data governance
  • Data management
  • Data science and analytics
  • TCO Planning 
  • Productization
  • Future Proofing
  • Intelligent SOP
  • Lakehouse Optimization
  • Data Migrations

Industries

  • Manufacturing
  • Retail
  • Health & Life Sciences
  • Financial Services
  • Manufacturing
  • Retail
  • Health & Life Sciences
  • Financial Services

Databricks

  • Databricks Center of Excellence
  • QuickStart Offerings
  • Accelerated Data Migration
  • Accelerated Unity Catalog Migration
  • The Lakehouse Optimizer
  • Accelerated Snowflake to Databricks Migration
  • Databricks Center of Excellence
  • QuickStart Offerings
  • Accelerated Data Migration
  • Accelerated Unity Catalog Migration
  • The Lakehouse Optimizer
  • Accelerated Snowflake to Databricks Migration

About

  • Our approach
  • News
  • Events
  • Partners
  • Careers
  • Our approach
  • News
  • Events
  • Partners
  • Careers

Insights

Our work

Support

Contact us

Linkedin Youtube Facebook Instagram

© 2024 Blueprint Technologies, LLC.
2600 116th Avenue Northeast, First Floor
Bellevue, WA 98004

All rights reserved.

Media Kit

Employer Health Plan

Privacy Notice