skip to Main Content
Data Collection

The Opportunity and Challenges of Vehicle Data

Apr 10, 2024

In our countless conversations with customers, one thing continues to be clear: the increased opportunities from vehicle data will be a game changer in the years ahead for car manufacturers. Collecting and analyzing data delivers not just one but rather a wide range of benefits spanning from diagnostics, planning, driver services, safety, downstream revenue and much more.

Yet, as the industry shifts towards software-defined vehicles (SDVs), there are challenges. Handling the vast amount of vehicle data is a key challenge that car manufacturers face when managing vehicle data collection. In addition, data security and privacy of driver data, vehicle location, and driving data must be protected. Based on customer conversations, I believe there are several critical aspects of aspects of these opportunities and challenges around vehicle data that need to be addressed.

Enabling continuous innovation is important

One of the promises of SDVs is the ability to react to changing vehicle needs that evolve over time. When discussing this topic with OEMs and tier-1 suppliers, some of them believe that enabling Over The Air (OTA) software updates is all that’s required to enable such evolution. But, it’s much more complicated than that.

First, unless vehicles are built on a flexible foundation that enables reconfiguration of vehicle capabilities over time, then the ability to adapt to later requirements will be limited. Second, vehicle software updates are difficult and time-consuming, requiring planning and executing the software development, complex validation, and only then subsequent OTA. So while updating vehicle software via OTA is an important aspect of SDVs, it does not enable the kind of dynamic responsiveness needed to respond to emergent needs of the car manufacturer.

Consider for example an emergency braking system (soon to be a mandatory element of advanced driver assistance systems) that is not operating properly. This year at CES 2024 we demonstrated, alongside leading cloud supplier AWS, how adaptive data collection coupled with rich real-time cloud data analytics can help identify the problem and improve safety in hours or days. Once diagnosed, the ultimate solution to the problem might be via an OTA update, but waiting weeks or months for an OTA just to diagnose the problem would inject unacceptable time that could lead to liability, loss of reputation, or injury.

To learn more and to see this demo of data collection with AWS cloud processing check out these resources:

Improving operational efficiency means cutting across OEM silos

As data becomes more critical, it is essential to handle it efficiently and ensure reuse within the OEM and their downstream providers. In many vehicles on the road today that do leverage vehicle data, it is frequently the case that data from different subsystems such as ADAS, body, powertrain, and battery are sent narrowly to their respective owning groups within the OEM. On its surface, this may seem a sensible division of labor that gets the respective groups the data they need to analyze their subsystems and improve them. Or is it?

The reality is that broader contextual awareness of how the vehicle is being driven is important, and the inter-dependencies between systems is growing. Siloed data ends up causing two significant problems, each as pernicious as the other. First, creating multiple smaller data sets or individual subsystems prevents holistic analytics that may be used for superior decision-making. Understanding driver behavior and vehicle performance outside of a siloed subsystem enables better optimization and a stronger result. Second, there is a massive inefficiency associated with collecting different data streams per subsystem. In some vehicles, there are multiple parallel pipes to the cloud, which increases the cost and complexity of handling data.

Instead of creating multiple disconnected single-purpose “data ponds” from a vehicle, our goal should be to create an intelligent “data lake” that can be accessed as a single source of truth from many groups across an OEM. Some leading OEMs such as Hyundai and BMW are embracing this trend, but many others are slow to shift to this richer approach.

Check out these episodes from The Garage podcast to learn more on this trend:

Optimizing upload, storage, and processing in the cloud is essential

Most OEMs are rapidly embracing the importance of cloud processing to solve a range of use cases. However it is not yet universally understood how to do so efficiently. Initial thinking might be to view the cloud as an infinite compute and boundless storage resource, but practical cost considerations quickly come to bear. Starting with LTE upload, high volumes of data quickly rack up an expensive wireless bill that can be prohibitive. Of course, upload could be relegated to WiFi only, but with cars only intermittently connected to WiFi, this greatly limits the power of vehicle data. The fact is that some data needs to be uploaded over cellular networks to be useful, and so most cars have one or more LTE connections back to the OEM.

The cost considerations continue in the cloud. As discussed earlier, the benefits of creating a data lake instead of a data pond are indeed compelling. However, uploading data that will never be used will incur cloud storage costs and can become quite expensive. Finally, there is the question of cloud processing power needed to analyze data. If we start with a pile of unneeded data, it will require a massive amount of computation to look for the needle in the haystack. A far better approach is to strategically identify valuable types of data and targeted situations warranting data capture.

For example, it’s far more useful to capture high-resolution diagnostic data near the reporting of a DTC (Diagnostic Trouble Codes) fault code to provide more information for later analysis than it is to continuously collect data when things are going well. Similarly, if there is an indication of a potential recall, it may be important to capture more data than usual but turn that data capture off once the situation is understood or resolved. The ability to dynamically optimize the use of capture, cloud upload, storage and processing, are paramount to achieving an acceptable ROI on data.

Protecting privacy and sensitive personal information

Alongside all these opportunities and challenges, ensuring data privacy while collecting data and storing vehicle data is paramount. Privacy laws are rapidly expanding: in Europe, GDPR (General Data Privacy Regulation) and in the US, state regulations like the California Consumer Privacy Act both mandate that data collection and sensitive personal information including gps data, vehicle history, driving habits, and other data points be protected. At the same time, drivers of modern vehicles desire the ability to make informed decisions about sharing their data on vehicle operation, such as with insurance carriers for superior rates for safe drivers. Most drivers are open to sharing some data about driving behavior if insurers offer discounts, but drivers need to be in control.

Techniques being considered by car manufacturers like anonymizing sensitive data points like the vehicle identification number, vehicle’s location and similar vehicle data can potentially enable vehicle history reports that can improve vehicle quality, promote automotive innovation, improve vehicle performance and advance driving safety — all while protecting privacy of such data.

Exploring these topics in more detail

The opportunity for vehicle data is significant, but the challenges cannot be ignored and many aspects that need to be considered in order to find the right solution. At Sonatus, we are working directly to solve these issues. Sonatus Collector allows car manufacturers to address these challenges by instantly gathering precise vehicle data using lightweight collection policies to improve operational efficiency and further drive continuous vehicle innovations. At the same time, the data collection for all this data can be done

In the coming months, we will be exploring all of these topics more deeply and through different lenses to shed light on potential solutions and highlight how industry leaders are tackling them with success. I hope you will join us as we explore these topics in more detail.

Back To Top