2/18/2026 | USA | technology | ✓ Verified - arxiv.org

Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings

#building semantics #AI model training #architecture #engineering #construction #operations #one‑hot encoding #large language model #LLM encodings #semantic relationships #anatomy of built environments

📌 Key Takeaways

Accurate representation of building semantics is critical for AI training in the AECO industry.
Traditional one‑hot encoding fails to capture subtle relationships between related subtypes.
A novel LLM‑based encoding strategy is proposed to better preserve building semantics.
The approach targets improved AI semantic comprehension for architectural and engineering applications.
The study was disseminated as a free preprint on arXiv in February 2026.

📖 Full Retelling

Researchers have introduced a novel large‑language‑model (LLM) based encoding approach aimed at preserving building semantics more effectively during AI model training for the architecture, engineering, construction and operations (AECO) sector. The study, published as an arXiv preprint on February 17, 2026, addresses the shortcomings of traditional one‑hot encoding techniques that often miss nuanced relationships among closely related building subtypes, thereby limiting AI’s semantic understanding. By leveraging LLM encodings, the authors seek to enhance AI capabilities in comprehending both generic object types and specific subtypes within built environments.

🏷️ Themes

Artificial intelligence, Semantic preservation, Building information modeling, Large language models, AECO industry, Data encoding

Entity Intersection Graph

No entity connections available yet for this article.

Deep Analysis

}

Original Source

              arXiv:2602.15791v1 Announce Type: new 
Abstract: Accurate representation of building semantics, encompassing both generic object types and specific subtypes, is essential for effective AI model training in the architecture, engineering, construction, and operation (AECO) industry. Conventional encoding methods (e.g., one-hot) often fail to convey the nuanced relationships among closely related subtypes, limiting AI's semantic comprehension. To address this limitation, this study proposes a novel
            

Read full article at source

Source

arxiv.org

Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings

📌 Key Takeaways

📖 Full Retelling

🏷️ Themes

Entity Intersection Graph

Deep Analysis

Source

More from USA

News from Other Countries

🇬🇧 United Kingdom

🇺🇦 Ukraine