How Space-Based Data Will Drive the Digital Economy
January 14th, 2025The space industry is undergoing rapid transformation as new reusable rockets and artificial intelligence tools for optimizing and enhancing space networks emerge. Alongside integrating these emerging technologies, which signify an inevitable future of edge-computing satellites on the horizon, the government and commercial sectors are showing a growing appetite for one of the most important commodities in our digital economy: space-based data.
The State of Space-based Data
According to ABI Research’s latest release of Satellite Constellations and Launch 2024, there are over 10,000 active satellites in orbit, over 900 of which are active Earth Observation (EO) satellites (optical, radar, meteorology). By 2032, this number is expected to grow to more than 2,300 EO satellites. With some industry estimates indicating that each EO satellite generates approximately 100 terabytes (TB) of data daily, this would collectively amount to around 230 petabytes (PB) of data per day by 2032.
Enhancing this large volume of EO data is an ever-growing suite of powerful AI models and tools, often developed and trained by EO operators themselves and by third-party data analytic firms, to help accelerate productivity and enhance analysis. And, unlike AI pure-play companies, they have more control of their training data. In this way, distinct clusters of AI companies, those focused on internet-trained AI data — large language models (LLMS) — and others focused on space-based unlabeled data — foundation models (FM) are set to evolve.
The Impact Potential of Space-based Data
While space-based data has found use for applications within environmental monitoring, agriculture, military and defense, and finance there is much potential beyond predicting natural disasters or the movements of adversaries.
Map View Data for Autonomous Systems: Autonomous vehicles, such as Tesla, use a range of sophisticated sensors on board to provide a comprehensive view of their surroundings. Some autonomous vehicles even include LiDAR to help generate 3D maps of the environment like some EO systems currently do. However, these integrated sensors and cameras are typically limited by range (around 250 meters) and can be limited by lighting and weather conditions.
In space, however, the range of such systems can be significantly expanded, and architecture enables flexible data routing around storms. By leveraging FMs with a persistent view of dedicated locations on Earth, EO satellites could provide critical geospatial data analysis for autonomous vehicles, effectively extending the vehicle’s data perception range, and aiding them in navigation, prediction, and even real-time decision-making.
This synergy of sensors and AI models on Earth and space could also be applied to aerial vehicles, such as drones or air taxis, and provide advantages over radio simultaneous localization and mapping (SLAM) or radar, especially in more dense urban areas that affect the waveform of these technologies.
Support for Self-healing Networks: Communication service providers (CSPs) and telcos can also leverage space-based data to enhance the predictive analysis and response times to critical outages of the network. As networks become increasingly complex and gain the ability to detect, diagnose, and repair the network autonomously, the need for more data to identify patterns and potential problems before they occur is paramount.
In this scenario, space-based data can help support advanced self-healing capabilities such as AI-driven anomaly detection and repair of deployment of ad hoc connectivity infrastructure, such as High-Altitude Platform Stations (HAPS), for network infrastructure that is offline due to cyberattacks or inclement weather. In these ways, space-based data can add a critical analytical support element to network resiliency efforts and help maintain network SLAs.
Evolution of FM and LLMs parameters: Major potential customers of space-based data are other AI models, particularly LLM companies such as OpenAI or Anthropic, who could use this data to improve their model performance and generalization. While this can help train image recognition models to learn more diverse patterns and scenarios and provide more robust predictions of real-world events, it also unlocks the potential for faster and higher-quality responses to geospatial data.
From this perspective, future models trained on and analyzing multi-dimensional and even multi-modal data can synthesize and provide insights into complex events occurring online and in the physical world. Much like the 3GPP’s non-terrestrial network (NTN) standard seeks to integrate terrestrial and non-terrestrial network infrastructure, the evolution of AI models will inevitably integrate data from terrestrial and non-terrestrial sources.
Where to Go from Here?
Many organizations depend on satellite data for decision-making, but transmitting critical space-based data is often restricted to times when satellites pass over ground stations. In this way, true real-time EO satellite data is challenging to achieve, however, some systems are coming very close to providing near real-time data, such as Iceye, Plant Labs, or Maxar Technologies (with image collection every 20 to 30 minutes). To overcome time restriction challenges in orbit, many companies are beginning to adopt edge computing, processing data directly on the satellite, to help reduce the need to send large volumes of raw data back to Earth, optimizing the limited time window provided for analysis.
While edge computing and satellite adoption are still early days, mass deployment of EO satellites for continuous coverage or integration into a larger mesh architecture may help provide customers with the continuous stream of data they may need. As satellite technology continues to improve and become more accessible, there will inevitably be greater integration of space-based data into our digital infrastructure.
This convergence of space technology and digital capabilities will not only enhance existing services but also unlock innovative solutions to global challenges and ultimately contribute to a more connected, efficient, and sustainable digital future. Indeed, with the projected synergies with AI and ML, ABI Research estimates revenue generated from the sale of EO data will increase to over $ 6 billion by 2028, reflecting a 2023-2028 compound annual growth rate (CAGR) of 15 percent. VS
Andrew Cavalier is a senior analyst for ABI Research
Lead photo is a Via Satellite archive photo