The Advanced Semiconductor Supply Chain Dataset includes manually compiled, high-level information about the tools, materials, processes, countries, and firms involved in the production of advanced logic chips. The current version of the dataset reflects how researchers understood this supply chain in early 2021. It uses a wide variety of sources, such as corporate websites and disclosures, specialized market research, and industry group publications.
The dataset csv files are available on Github.
You can use this dataset to:
Most of the data is taken from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness, published in 2021. In 2022, ETO researchers augmented the company profiles in the data with information manually collected from producer websites and the open internet. Read more >>
Please cite the "Emerging Technology Observatory Advanced Semiconductor Supply Chain Dataset (2022 release)," including the link. If you use the explorer tool to access the data, you can cite that tool instead.
The dataset consists of five csv tables: inputs, providers, provision, sequence, and stages.
This table includes basic information about inputs to advanced chip production. Inputs include processes, tools, and materials. Material inputs are consumed in the production process (e.g. photoresist, wafers); tools are durable (e.g. photolithography equipment).
|input_name||text||The name of the input.|
|input_id||text (ID)||A unique alphanumeric identifier for the provider.|
|type||text||Whether the input is a process, tool, or material input.|
|stage_name||text||The name of the production stage to which the input belongs. For inputs of type process only.|
|stage_id||text (ID)||For inputs of type process only, indicates the ID of the production stage in which the process takes place. Connects to the stages table.|
|description||text||A short narrative summary of the input and its significance. All summaries are adapted from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness.|
This table lists countries and firms that provide inputs to advanced chips.
|provider_name||text||The name of the provider. Countries are identified with their three-digit ISO codes (ISO 3166). Here (and generally in ETO resources) we use "country" informally, as a shorthand term for sovereign countries, independent states, and certain other geographic entities. Read more >>|
|provider_id||text (ID)||A unique alphanumeric identifier for the provider.|
|provider_type||text||Whether the provider is a country or an organization.|
|provider_name||text (ISO 3166)||For providers of type organization, indicates the country in which the organization is headquartered.|
This table describes the specific inputs provided by each country and firm, presented as provider-input pairs.
|provider_name||text||The name of a provider.|
|provider_id||text (ID)||The unique identifier of the provider. Connects to the providers table.|
|provided_name||text||The name of an input provided by the provider.|
|provided_id||text (ID)||The unique identifier of an input provided by the provider. Connects to the inputs table.|
|share_provided||percentage||The provider's market share for the specified input. This figure is generally available for countries, rather than firms, and refers in that case to the collective market share of all firms headquartered in that country. (In some cases, a provider country will not have a share_provided value for a particular input, reflecting limitations in the underlying dataset.)|
|negligible_market_share||text||Whether the provider accounts for a negligible share of the global market in the specified input. This field will only be populated if share_provided is empty (but will not be populated in every such instance). A company will be designated a negligible provider of a given input if (a) its headquarters country has less than 2% or less of the global market share for that input or (b) the provider accounts for a negligible share of the global market in the specified input, as indicated by the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness.|
This table describes the relationships between different inputs. There are two types of relationship described: inputs that "go into" other resources (e.g., in the case of a material that is used in a process, or a process that occurs directly before another process), and inputs that are specific subtypes of other defined inputs (e.g., EUV photolithography machines are designated as a type of photolithography equipment).
|input_name||text||The name of an input.|
|input_id||text (ID)||The unique identifier of the input. Connects to the inputs table.|
|goes_into_name||text||The name of another input into which the initial input is incorporated or otherwise connected.|
|goes_into_id||text (ID)||The unique identifier of the input identified in goes_into_name. If this field is populated, then is_type_of_id will not be populated. Connects to the inputs table.|
|is_type_of_name||text||If the initial input is a sub-type of another kind of input, the name of the second input will be listed here.|
|is_type_of_id||text (ID)||The unique identifier of the input identified in is_type_of_name. If this field is populated, then goes_into_id will not be populated. Connects to the inputs table.|
This table describes different stages of the production process for advanced chips.
|stage_name||text||The canonical name of the stage.|
|stage_id||text (ID)||A unique alphanumeric identifier for the stage.|
|description||text||A short narrative summary of the stage and its significance. All summaries are adapted from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness.|
Almost all data are taken from the CSET report The Semiconductor Supply Chain: Assessing National Competitiveness, which was published in 2021. Information in that report was gathered manually from a wide variety of sources and was generally current as of 2019. Refer to the relevant sections and footnotes in the paper for specific sourcing. Certain company names and countries were updated by ETO researchers in 2022.
All data from the 2021 CSET report were manually extracted from the report and entered into this dataset. Updated company information was manually collected by ETO researchers from credible open sources, including company websites, trade publications, and major news outlets. For about 75% of companies, the information was independently reviewed for sourcing and accuracy in collection by at least one other researcher.
The dataset csv files are available on Github.
Learn about how and where advanced logic chips are produced and the tools, materials, and processes that are involved. You can read directly from the raw data or use our Explorer tool to browse visually.
Assess countries' and companies' role in the supply chain using the dataset's extensive provider information. If you have a specific input in mind, you can open it in the Explorer tool to quickly view associated countries (usually with per-country market share) and firms. More complex queries can be performed on the raw data using your favorite analysis tool.
Identify "chokepoints," market concentration, dependency relationships, and other structural features of the supply chain. You can use the Explorer's market concentration filter as an entry point here. More complex structural characteristics can be browsed visually with the Explorer tool or defined systematically with other tools.
Because this dataset is collected manually by analysts, updating it takes significant work and time. We plan to periodically release new, comprehensively updated versions annually at most. Older versions will remain accessible on this page and in Github. The next update is not yet scheduled.
Between these major updates, there may be minor revisions to individual data points based on user feedback. These revisions will be logged on the Github pages for the relevant tables.
Use our general issue reporting form. Or, if you access the dataset through the Supply Chain Explorer, you can submit issue reports for specific fields or data points using the "Report an Issue" links embedded in the tool. Read more >>
This dataset is based on Saif M. Khan, Alexander Mann, and Dahlia Peterson, The Semiconductor Supply Chain: Assessing National Competitiveness (Center for Security and Emerging Technology, January 2021).
Additional support came from: