Today, we're highlighting an ETO resource you might not have noticed before: our dataset portal, which went live on our website earlier this fall. ETO's web-based tools offer data-driven insight for users of all backgrounds - no data skills required. However, we know some of our users want to dig into the data on a deeper level than our tools can provide. If you're one of those power users, our new portal provides open access to many ETO datasets - so you can work with the data your way.
Not all ETO data are currently available through the portal, but we're working to make more and more of our original datasets open to the public. As of today, public data include:
- Country AI Activity Metrics: National-level metrics for research, patents, and private-market investment in AI and its subfields.
- Cross-Border Tech Research Metrics: Metrics for cross-border research in emerging technology domains, such as AI, robotics, and cybersecurity.
- Private-Sector AI Indicators: Diverse indicators of AI-related activity for hundreds of companies worldwide, from startups to multinationals.
- Advanced Semiconductor Supply Chain Dataset: Manually compiled, high-level data about the tools, materials, processes, countries, and firms involved in the production of advanced logic chips.
- AGORA Dataset: An extensively annotated, frequently updated collection of AI-relevant laws, regulations, standards, and other governance documents.
- ETO OpenAlex Overlay: CSET’s original language labels and emerging-tech subject classifications for OpenAlex works.
All of these datasets are available today at https://eto.tech/datasets/. Each comes with detailed documentation, including access instructions, full schemas, use cases, methodology, and limitations.
We plan on releasing more data in 2025, and we'd like to prioritize data our users would find most helpful. If there's emerging tech data that would make a difference in your work, please let us know!
In the meantime, please feel free to contact us with any questions about ETO data - or drop by for live support during our office hours. We'll be glad to help. 🤖