Skip to content
Blueprint Technologies - Data information specialists
Main Menu
  • What we do

      Artificial Intelligence

      Intelligent SOP
      Generative AI
      Video analytics

      Engineering

      Application development
      Cloud & infrastructure
      Lakehouse optimization

      Data & Analytics

      Data platform modernization
      Data governance
      Data management
      Data migration
      Data science & analytics

      Strategy

      TCO planning
      Productization
      Future proofing
  • Industries

      Manufacturing

      Enhance productivity and efficiency through tailored technology solutions, optimizing processes, and drive innovation in manufacturing operations.

      Retail

      Revolutionize customer experiences through innovative technology solutions for seamless shopping journeys and enhanced retail operations.

      Health & Life Sciences

      Advance healthcare outcomes and pharmaceutical innovations through cutting-edge technology solutions and data-driven strategies.

      Financial Services

      Empower financial institutions with secure and scalable technology solutions, driving digital transformation, and personalized customer experiences.

  • Databricks

      Databricks
      Center of Excellence

      Maximize your Databricks experience with our comprehensive Center of Excellence resources and support.

      QuickStarts

      Proof-of-value projects designed to get you started quickly on Databricks.

      Accelerated Data Migration

      Regardless of the source, we specialize in migration your data to Databricks with speed and quality.

      Unity Catalog Migration

      Accelerate your UC migration and minimize errors with our meticulously tested Brickbuilder approved solution.

      Lakehouse Optimizer

      Get higher return on your investment and minimize your total cost of ownership with self-facilitated optimization.

      Accelerated Snowflake to Databricks Migration

      Unlock increased cost savings, heightened operational efficiency, and enhanced analytical capabilities. 

  • Our work
  • Insights
  • About

      Our Approach

      Discover our holistic approach to uncovering strategic opportunities.

      Careers

      Explore exciting career opportunities and join our team today.

      News

      Get the latest updates and insights about our company.

      Events

      Stay updated on upcoming events and webinars.

      Our Partners

      Get to know our trusted technology partners and collaborators.

Connect
Blueprint Technologies - Data information specialists

I feel the need. The need for speed…dee data

By Gary Nakanelua

As managing director of Innovation at Blueprint Technologies, I have the pleasure of working directly with some of the most talented data scientists in the world, both within our company and through our various partners. A common theme I have found in projects involving data science is the need for significant amounts of data.

Recently, we worked with our largest partner, Microsoft, on a video analytics project. It was an incredible opportunity to experiment with Azure for video processing and analysis. The case study for this project will be published soon, so rather than detail out the solution, I’ll cover a problem we had to overcome early in the project: availability of relevant video data.

We had 60 days to go from whiteboard to market with a video analytics solution that solved for a specific use case within a specific industry. We needed overhead video footage of people and vehicles within a city environment. After a bit of Google-Fu, we found quite a few overhead static imagery datasets but we needed video. The few video datasets we did find lacked the desired consistency. We had to figure out something different. Quickly.

We experimented with generating the video we needed using drones. The approach lacked the traffic density we needed.

Attempts to capture footage of live traffic resulted in warnings by local law enforcement on the use of civilian drones in high traffic areas. It was time to try something different. Or get arrested.

Previously, we had success generating training data for machine learning models using video games. In fact, at the Apache Spark + AI Summit a few years ago, we presented our research in training collision detection for an autonomous drone experiment using Doom.

Due to the ability to build a world to fit our needs, we originally intended to use Minecraft. In 2014, Microsoft acquired Mojang, the game studio that created Minecraft. Two years later, Microsoft publicly unveiled Project Malmo, “a sophisticated AI experimentation platform built on top of Minecraft, and designed to support fundamental research in artificial intelligence.” You can check out Project Malmo here. However, Minecraft lacked the vehicles and associated driving behaviors we needed for the project.

We were introduced to AirSim, an open source simulator for autonomous vehicles built on the Unreal Engine from Microsoft AI & Research. Based upon the demos, it appeared to have everything we needed to generate our video data. You can check out AirSim here. However, building AirSim on a MacBook was proving to take more time than anticipated. The documentation did note that “It should be possible to build AirSim on OSX as well, but it isn’t actively tested.” Yet again, we had to find a different way.

The solution to our problem turned out to be one of the most ambitious simulations of a city available: Grand Theft Auto V. It was created by Rockstar North and the studio took great care in attempting to recreate Los Angeles (Los Santos as it is referred to in the game). The studio sent out multiple research teams throughout Los Angeles and shot over 250,000 images and hours of video. From the Los Angeles International Airport and Beverly Hills to landmarks such as the Hollywood sign and the Griffith Observatory, Grand Theft Auto V had all the elements we needed to generate our video data.

The game includes a director mode, which allowed us to control traffic density, pedestrian population, time of day, weather, and camera angle. Camera control would prove to be the most beneficial as our early attempts in the game started with hovering over a particular area of the city in a helicopter.

This approach saved weeks of time. We avoided having to program traffic simulations and randomization patterns. It provided high enough fidelity that we avoided having to travel to physical locations to film the video footage we need (and avoid getting arrested). It provided the flexibility necessary to generate hours worth of relevant training and testing data. Using this data, we were able to train various algorithms for object identification and tracking. In addition, the video data is used to train activity similarity models and improve overall accuracy of the models.

Although using the game to train machine learning models may not be what the designers had in mind, the approach proved to be a quick and efficient way of generating pedestrian and vehicles-in-motion activity. Unfortunately, we didn’t have programmatic access to the game (we used a Xbox) so we had to actually play the game to get the locations and activity we wanted. The sacrifices we make in the name of data science…

Share with your network

You may also enjoy

Classic vs. Serverless: Exploring Databricks’ latest Innovations

Explore the benefits of Databricks’ serverless solutions, which simplify resource management, improve productivity, and optimize costs. Discover key insights and best practices to enhance your data strategy with cutting-edge serverless technologies.

Help for FinOps Leaders – How the Lakehouse Optimizer can assist with your Lakehouse 

Discover how FinOps leaders manage cloud and data costs effectively while maximizing business value. Learn how the Lakehouse Optimizer (LHO) addresses common business problems through discovery, optimization, and operation.
Blueprint Technologies - Data information specialists

What we do

  • Generative AI
  • Video analytics
  • Application development
  • Cloud and infrastructure
  • Data platform modernization
  • Data governance
  • Data management
  • Data science and analytics
  • TCO Planning 
  • Productization
  • Future Proofing
  • Intelligent SOP
  • Lakehouse Optimization
  • Data Migrations
  • Generative AI
  • Video analytics
  • Application development
  • Cloud and infrastructure
  • Data platform modernization
  • Data governance
  • Data management
  • Data science and analytics
  • TCO Planning 
  • Productization
  • Future Proofing
  • Intelligent SOP
  • Lakehouse Optimization
  • Data Migrations

Industries

  • Manufacturing
  • Retail
  • Health & Life Sciences
  • Financial Services
  • Manufacturing
  • Retail
  • Health & Life Sciences
  • Financial Services

Databricks

  • Databricks Center of Excellence
  • QuickStart Offerings
  • Accelerated Data Migration
  • Accelerated Unity Catalog Migration
  • The Lakehouse Optimizer
  • Accelerated Snowflake to Databricks Migration
  • Databricks Center of Excellence
  • QuickStart Offerings
  • Accelerated Data Migration
  • Accelerated Unity Catalog Migration
  • The Lakehouse Optimizer
  • Accelerated Snowflake to Databricks Migration

About

  • Our approach
  • News
  • Events
  • Partners
  • Careers
  • Our approach
  • News
  • Events
  • Partners
  • Careers

Insights

Our work

Support

Contact us

Linkedin Youtube Facebook Instagram

© 2024 Blueprint Technologies, LLC.
2600 116th Avenue Northeast, First Floor
Bellevue, WA 98004

All rights reserved.

Media Kit

Employer Health Plan

Privacy Notice