Two former OpenAI employees have built a website called "In the Weights" that reveals which people AI models can recall pure…
A new website, "In the Weights," allows users to query large language models and determine if and how strongly they are represented in the AI's training data, quantified by a "strength score." This offers a novel lens into the proprietary datasets powering models like OpenAI's GPT-4 and Google's PaLM 2, moving beyond theoretical discussions of data provenance to empirical, user-driven exploration.
The implications extend to privacy, intellectual property, and the very definition of "knowledge" for these systems. For individuals, it raises questions about personal data usage and potential biases embedded within models. For developers, it provides a tool to probe the composition and potential blind spots of their own or competitors' models, hinting at the hidden biases and biases within the datasets used.
Future developments will likely focus on expanding the database of queryable individuals and AI models, and refining the "strength score" methodology. Crucially, understanding the correlation between this score and model behavior, such as factual recall or generation of personally identifiable information, will be key to assessing the practical impact of this newfound transparency.