The Advanced Semiconductor Supply Chain Dataset includes manually compiled, high-level information about the tools, materials, processes, countries, and firms involved in the production of advanced logic chips. The current version of the dataset reflects how CSET researchers understood this supply chain in mid-2025, drawing on industry data and publicly available analyses.
The dataset csv files are available on Github.
You can use this dataset to:
Most of the data was adapted by CSET researchers from industry data provided by TechInsights. We augmented the TechInsights data with other publicly available analyses and information from prior CSET research. Read more >>
No.
This dataset is subject to ETO's general terms of use. If you use it, please cite us.
Please cite the "Emerging Technology Observatory Advanced Semiconductor Supply Chain Dataset (2025 release)," including the link. If you use the explorer tool to access the data, you can cite that tool instead.
The dataset consists of five csv tables: inputs, providers, provision, sequence, and stages.
This table includes basic information about inputs to advanced chip production. Inputs include processes, tools, and materials. Material inputs are consumed in the production process (e.g. photoresist, wafers); tools are durable (e.g. photolithography equipment).
Column name | Type | Description |
input_id | text (ID) | A unique alphanumeric identifier for the input. |
input_name | text | The name of the input. |
type | text | Whether the input is a process, tool, design, or material input. |
stage_name | text | The name of the production stage to which the input belongs. For inputs of type process only. |
stage_id | text (ID) | For inputs of type process only, indicates the ID of the production stage in which the process takes place. Connects to the stages table. |
description | text | A short narrative summary of the input and its significance. Written by CSET researchers. Many summaries are adapted from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness. |
year | year | The year for which market size and/or market share data is provided for this input. |
market_share_chart_global_market_size_info | text | Total global revenue from sales of the input for the year specified in year. |
market_share_chart_caption | text | A caption for the market share charts displayed for this input. |
market_share_source | text | The source of the market size and share data provided for this input. |
This table lists countries and firms that provide inputs to advanced chips. A provider may be listed multiple times if it has more than one alias.
Column name | Type | Description |
provider_name | text | The name of the provider. Countries are identified with their three-digit ISO codes (ISO 3166). Here (and generally in ETO resources) we use "country" informally, as a shorthand term for sovereign countries, independent states, and certain other geographic entities. Read more >> |
alias | text | Another name for the provider. |
provider_id | text (ID) | A unique alphanumeric identifier for the provider. |
provider_type | text | Whether the provider is a country or an organization. |
country | text (ISO 3166) | For providers of type organization, indicates the country in which the organization is headquartered. |
This table describes the specific inputs provided by each country and firm, presented as provider-input pairs.
Column name | Type | Description |
provider_name | text | The name of a provider. |
provider_id | text (ID) | The unique identifier of the provider. Connects to the providers table. |
provided_name | text | The name of an input provided by the provider. |
provided_id | text (ID) | The unique identifier of an input provided by the provider. Connects to the inputs table. |
share_provided | percentage | The provider's market share for the specified input in a given year. This figure is generally available for countries, rather than firms, and refers in that case to the collective market share of all firms headquartered in that country. (In some cases, a provider country will not have a share_provided value for a particular input, reflecting limitations in the underlying dataset.) |
year | text | The year in which the provider specified in provider_name had the market share percentage specified in share_provided. |
source | text | The source of the data provided for each provider-input pair. |
This table describes the relationships between different inputs. There are two types of relationship described: inputs that "go into" other resources (e.g., in the case of a material that is used in a process, or a process that occurs directly before another process), and inputs that are specific subtypes of other defined inputs (e.g., EUV photolithography machines are designated as a type of photolithography equipment).
Column name | Type | Description |
input_name | text | The name of an input. |
input_id | text (ID) | The unique identifier of the input. Connects to the inputs table. |
goes_into_name | text | The name of another input into which the initial input is incorporated or otherwise connected. |
goes_into_id | text (ID) | The unique identifier of the input identified in goes_into_name. If this field is populated, then is_type_of_id will not be populated. Connects to the inputs table. |
is_type_of_name | text | If the initial input is a sub-type of another kind of input, the name of the second input will be listed here. |
is_type_of_id | text (ID) | The unique identifier of the input identified in is_type_of_name. If this field is populated, then goes_into_id will not be populated. Connects to the inputs table. |
This table describes different stages of the production process for advanced chips.
Column name | Type | Description |
stage_name | text | The canonical name of the stage. |
stage_id | text (ID) | A unique alphanumeric identifier for the stage. |
description | text | A short narrative summary of the stage and its significance. All summaries are adapted from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness. |
Unless otherwise specified, data in the Advanced Semiconductor Supply Chain Dataset is derived by CSET analysts from the TechInsights Chip Market Research Services (CMRS) Semiconductor Equipment Database (May 2025 release). The CMRS equipment database includes company-level revenue data for various semiconductor industry inputs organized hierarchically (e.g., EUV lithography tools are organized under lithography tools). CSET analysts mapped the revenue data for different inputs in the CMRS market segmentation to the most closely related inputs in the Advanced Semiconductor Supply Chain Dataset, in some cases mapping multiple CMRS inputs to a single one in our dataset. Based on that mapping, we populated the provision table as follows:
Unless otherwise specified, market size figures in the inputs table are the sum of revenues across all companies in the CMRS database in the year specified.
Companies that had no more than 1% market share for every input in our dataset were not assigned a country affiliation. Therefore, the market share of countries with many tiny suppliers would be underrepresented in our dataset, however, we have no specific reason to suspect there are cases like this.
The dataset csv files are available on Github.
Learn about how and where advanced logic chips are produced and the tools, materials, and processes that are involved. You can read directly from the raw data or use our Explorer tool to browse visually.
Assess countries' and companies' role in the supply chain using the dataset's extensive provider information. If you have a specific input in mind, you can open it in the Explorer tool to quickly view associated countries (usually with per-country market share) and firms. More complex queries can be performed on the raw data using your favorite analysis tool.
Identify "chokepoints," market concentration, dependency relationships, and other structural features of the supply chain. You can use the Explorer's market concentration filter as an entry point here. More complex structural characteristics can be browsed visually with the Explorer tool or defined systematically with other tools.
Because a substantial part of this dataset is collected manually by analysts, updating it takes significant work and time. We plan to periodically release new, comprehensively updated versions annually at most. Older versions will remain accessible on this page and in Github. The next update is not yet scheduled.
Between these major updates, there may be minor revisions to individual data points based on user feedback. These revisions will be logged on the Github pages for the relevant tables.
Use our general issue reporting form. Or, if you access the dataset through the Supply Chain Explorer, you can submit issue reports for specific fields or data points using the "Report an Issue" links embedded in the tool. Read more >>
Much of the data in the Advanced Semiconductor Supply Chain Dataset is derived by CSET analysts from the TechInsights Chip Market Research Services (CMRS) Semiconductor Equipment Database (May 2025 release). The dataset also incorporates data published by World Semiconductor Trade Statistics (WSTS) and the Semiconductor Industry Association (SIA), among other sources.
Prior releases of the dataset drew on data from Saif M. Khan, Alexander Mann, and Dahlia Peterson, The Semiconductor Supply Chain: Assessing National Competitiveness (Center for Security and Emerging Technology, January 2021).
Additional support came from:
7/14/25 | July 2025 update: updated most data from 2019 to 2024, revised input taxonomy, and updated country affiliations. |
10/13/22 | Initial release |