Table of Contents
Fetching ...

Life cycle assessment for all organic chemicals

Shaohan Chen, Tim Langhorst, Julian Nöhl, Christopher Oberschelp, Martin Pillich, Johannes Schilling, André Bardow

Abstract

Chemicals are embedded in nearly every aspect of modern society, yet their production poses substantial sustainability concerns. Achieving a sustainable chemical industry requires detailed Life Cycle Assessment (LCA); however, current assessments face many unknowns due to limited, partly inconsistent, and untransparent data coverage since existing Life Cycle Inventory (LCI) databases account for only a tiny fraction of traded chemicals. Here, we introduce the Chemical RetrosYnthesiS for Transparent Assessment of Life-cycles (CRYSTAL) framework, which automatically generates consistent and transparent LCI data for organic chemicals based on their molecular structure using retrosynthesis and machine-learned gate-to-gate inventories. Using the predictive power of CRYSTAL, we create a consistent database for more than 70000 organic chemicals, comprising over 110000 transparent LCI datasets that quantify both feedstock and energy demands, together with associated auxiliary materials, biosphere flows, and waste flows. From this comprehensive database, we identify 50 key environmental hotspots driving high impacts of organic chemical production across multiple environmental categories and pivotal hub chemicals that are most critical for downstream chemical production. In providing this comprehensive data foundation, the CRYSTAL framework offers systematic guidance for targeted engineering and policy interventions. Its transparent, modular nature is designed to shift chemical LCA from a reliance on "unknown unknowns" to a collaboratively improvable mapping of "known unknowns".

Life cycle assessment for all organic chemicals

Abstract

Chemicals are embedded in nearly every aspect of modern society, yet their production poses substantial sustainability concerns. Achieving a sustainable chemical industry requires detailed Life Cycle Assessment (LCA); however, current assessments face many unknowns due to limited, partly inconsistent, and untransparent data coverage since existing Life Cycle Inventory (LCI) databases account for only a tiny fraction of traded chemicals. Here, we introduce the Chemical RetrosYnthesiS for Transparent Assessment of Life-cycles (CRYSTAL) framework, which automatically generates consistent and transparent LCI data for organic chemicals based on their molecular structure using retrosynthesis and machine-learned gate-to-gate inventories. Using the predictive power of CRYSTAL, we create a consistent database for more than 70000 organic chemicals, comprising over 110000 transparent LCI datasets that quantify both feedstock and energy demands, together with associated auxiliary materials, biosphere flows, and waste flows. From this comprehensive database, we identify 50 key environmental hotspots driving high impacts of organic chemical production across multiple environmental categories and pivotal hub chemicals that are most critical for downstream chemical production. In providing this comprehensive data foundation, the CRYSTAL framework offers systematic guidance for targeted engineering and policy interventions. Its transparent, modular nature is designed to shift chemical LCA from a reliance on "unknown unknowns" to a collaboratively improvable mapping of "known unknowns".
Paper Structure (6 sections, 4 figures)

This paper contains 6 sections, 4 figures.

Figures (4)

  • Figure 1: The crystal framework automatically and transparently predicts cradle-to-gate lci data for organic chemical production and identifies environmentally optimal production pathways.crystal balances efficient lci data generation with the need to assess the environmental impacts of a vast number of traded chemicals, closing critical data gaps in life cycle assessments. Based on the molecular structure of the target chemical, we 1) predict retrosynthesis pathways, 2) estimate the lci data, i.e., material demands, energy demands, waste streams, and direct emissions for each reaction step, 3) construct a crn to transfer information across reactions, and 4) optimize the crn based on the lcia scores calculated from the predicted lci data to identify environmentally optimal production pathways using user-specified environmental objectives. The framework flexibly adapts to various lcia methods and background databases.
  • Figure 2: Performance of crystal on three state-of-the-art lca databases for climate change impact. The first three columns present the validation results of the crystal framework on the ecoinvent wernet2016ecoinventfrischknecht2005ecoinventecoinvent2025 (257.0 organic chemicals (version 3.10, cut-off) assessed with ReCiPe 2016 v1.03, midpoint (H)), cm.chemicals carbonminds_databasestellner2023_carbonminds (746.0 organic chemicals based on IPCC 2021 global warming potential (GWP100)), and a literature database langhorst2023stoichiometry (143.0 organic chemicals assessed with ecoinvent, version 3.5, cut-off) based on ReCiPe v1.13, midpoint (H)), using the corresponding database as the basis and a loo approach. The "Reference Comparison" column compares the ecoinvent and cm.chemicals databases for 232.0 chemicals identified as the complete intersection of the two databases with available smiles representations, serving as a benchmark for evaluating crystal's performance. The violin plots of the first three columns summarize crystal’s overall performance on each database, indicating the percentage of climate change (unit: CO2-eq per of the target chemical) predictions falling within an acceptable accuracy range, based on predicted-to-reference value ratios. Dashed lines indicate the acceptable range defined by the aace accuracy bounds (50200 of the environmental impact defined in the corresponding state-of-the-art database). Violin plots display the distribution between the 5th and 95th percentiles to highlight this range. The parity plots of the first three columns illustrate crystal’s Pearson correlation and mare when the last reaction step of the predicted routes is the same as the one of the reference databases.
  • Figure 3: Pinpointing environmental hotspots across impact categories relative to climate change to support chemical manufacturers, regulatory authorities, and lca data providers. Top row: Climate change versus human toxicity: carcinogenic for pathways optimized for climate change impact. Chromium trioxide and anthraquinone, together with chromium-containing waste incineration, are highlighted as relative environmental hotspots in human toxicity: carcinogenic, along with their downstream chemicals (left). Regulating chromium(VI) emissions for these two hotspot reactants and for the chromium-containing waste treatment effectively reduces human toxicity: carcinogenic of all downstream chemicals (middle). Number of downstream chemicals (D/S) associated with these hotspots (right). Bottom row: Climate change versus ozone depletion for pathways optimized for (left) climate change and (middle) ozone depletion impact. Relative environmental hotspots in climate-change-optimized pathways are highlighted together with their downstream chemicals (i.e., five upstream reactants (R-hotspots) and two solvents, chloroform and dichloromethane). To emphasize changes in point density, only downstream chemicals with disproportionately high ozone depletion impacts are highlighted (middle). Number of downstream chemicals associated with these hotspots (right). ecoinvent version 3.10 (cut-off) wernet2016ecoinventfrischknecht2005ecoinventecoinvent2025 and lcia method ReCiPe 2016 v1.03, midpoint (H) are used as the underlying lca database.
  • Figure 4: Hub chemicals identified to guide prioritization of process optimization strategies for enhancing the sustainability of the chemical industry.a) A representative example illustrating the structurally important role of a hub chemical within an illustrative chemical reaction network. The green part indicates the joint-optimal reactions across all environmental impact categories. b) Ranking of 52.0 hub chemicals by the number of strongly affected downstream chemicals, measured as those whose climate change impacts decrease by more than 2% in response to a 10% reduction in the impact of each hub chemical (details described in the Supplementary Text). Chemicals available in the underlying ecoinvent database are highlighted in red. c) Switching from fossil- to bio-based ethanol pathways substantially reduces the climate change impacts for over 2000.0 strongly affected downstream products.