Project Overview

ECO - an Early-stage Carbon Impact Observer for Intelligent Sustainability Forecasting in Buildings.

Developed by Daniel Favour O. Oshidero as part of an MSc in Computer Science at The University of Bath (2023/24).

ECO aims to close the gap in sustainable architectural integrations by introducing an intelligent model capable of translating early-stage design descriptions into precise estimates of material masses and embodied carbon footprints.

Research Statement

The investigation and development of a machine learning (ML) model and end-user tool for the prediction of highly accurate carbon impact analyses from early outlines, with potential future integration into existing architectural design pipelines.

Abstract

An increasing need for carbon awareness and sustainable architecture has prompted the development of tools that can predict and mitigate the environmental impact of new constructions from the earliest stages of design.

ECO is designed to integrate early-stage impact assessments into architectural processes, providing architects with immediate feedback on the carbon implications of their design choices, fostering more sustainable design practices from the outset.

Expected Impact of Research

The integration of intelligent learning techniques with environmental impact assessment could significantly transform the conceptualisation stage of building design with respect to their carbon footprint.

By equipping designers with the ability to predict the carbon impact of their proposals early and accurately, this research allows them to make informed, proactive decisions.

Potential Applications

- Existing building carbon audits with reduction recommendations.

- Real time carbon predictions in design meetings to enhance carbon awareness.

- Design optimisation capabilities through inverse predictions to find ideal parameters for carbon goals.

- Integration with operational energy systems for carbon offset forecasting.

Tool Specifications

This tool utilizes a robust dataset of 150,000 synthetic datapoints generated from pre-existing carbon calculators. Synthetic data has been used as there currently exists too large of a gap in existing research and data to currently predict whole building carbon from that data.

These calculators used to generate carbon impact employ carbon factors derived from the Inventory of Carbon and Energy (ICE) database and Environmental Product Declarations (EPDs).

The full pipeline is as follows:

Initial data entry in text form.

Feature Extraction using NLP techniques, including SpaCy's en_core_web_trf model for Named Entity Recognition and the all-mpnet-base-v2 model from Sentence Transformers for semantic similarity.

A Histogram-based Gradient Boosting (HistGradBoost) model processes the extracted features via regression.

The model generates a final prediction based on the processed data.

The final step regression model is highly precise with an average accuracy of 95% across multiple tests.

You can download any material assumptions here.
You can download any logical constraints here.

Disclaimers

This tool is made as a Proof of Concept. It can be used by students, architects, and designers alike, but with caution. The estimates and assessments provided by ECO are intended only for preliminary analysis and should not be relied upon as the final source of information for critical decision-making. Users are encouraged to validate results with additional tools and professional expertise.

Open issues:

- Currently, the algorithm tends to favour larger buildings. Smaller buildings may be over-engineered. This can be resolved with longer descriptions.