The Geometry of Concepts: Sparse Autoencoder Feature Structure

Yuxiao LiEric J. MichaudDavid D. BaekJoshua EngelsXiaoqing SunMax Tegmark

Sparse autoencoders have recently produced dictionaries of high-dimensional vectors corresponding to the universe of concepts represented by large language models. We find that this concept universe has interesting structure at three levels: 1) The “atomic” small-scale structure contains “crystals” whose faces are parallelograms or trapezoids, generalizing well-known examples such as (man-woman-king-queen). We find that the quality of such parallelograms and associated function vectors improves greatly when projecting out global distractor directions such as word length, which is efficiently done with linear discriminant analysis. 2) The “brain” intermediate-scale structure has significant spatial modularity; for example, math and code features form a “lobe” akin to functional lobes seen in neural fMRI images. We quantify the spatial locality of these lobes with multiple metrics and find that clusters of co-occurring features, at coarse enough scale, also cluster together spatially far more than one would expect if feature geometry were random. 3) The “galaxy” scale large-scale structure of the feature point cloud is not isotropic, but instead has a power law of eigenvalues with steepest slope in middle layers. We also quantify how the clustering entropy depends on the layer.

View PDF

  • Related Posts

    The Rise of Efficient AI Models: TinySwallow and Beyond

    In the ever-evolving landscape of artificial intelligence, there’s a significant shift happening. Companies are beginning to realize that bigger doesn’t always mean better. Instead, the focus is now on creating smaller, more efficient AI models that can deliver high performance…

    DeepSeek: A China-Based LLM with Global Implications

    1. Overview of DeepSeek DeepSeek is a large-scale language model developed by a Chinese tech company, optimized mainly for processing the Chinese language. Its name suggests capabilities in both deep learning (“Deep”) and search/analysis (“Seek”). Based on available information and…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    The Rise of Efficient AI Models: TinySwallow and Beyond

    The Rise of Efficient AI Models: TinySwallow and Beyond

    Philosophical and Historical Considerations on AI and Basic Income

    Philosophical and Historical Considerations on AI and Basic Income

    Understanding the AI Bubble: The DeepSeek Shock and Its Implications

    Understanding the AI Bubble: The DeepSeek Shock and Its Implications

    The DeepSeek Shock: How a Chinese AI Startup Disrupted the U.S. Stock Market

    The DeepSeek Shock: How a Chinese AI Startup Disrupted the U.S. Stock Market

    Neuromorphic Computing: Can It Play a Role in Mainstream AI Development?

    Neuromorphic Computing: Can It Play a Role in Mainstream AI Development?

    The AI Arms Race: Insights from Scale AI CEO Alexandr Wang

    The AI Arms Race: Insights from Scale AI CEO Alexandr Wang