Interpretability

Why AI is Harder Than We Think: Key Takeaways for Business Leaders

A new study has shown that transformers can be expressed in a simple logic formalism. This finding challenges the perception that transformers are inscrutable black boxes and suggests avenues for interpreting how they work.

Ines Almeida

13.08.23 07:50 PM - Comment(s)

Transformers Expressible in Simple Logic

A new study has shown that transformers can be expressed in a simple logic formalism. This finding challenges the perception that transformers are inscrutable black boxes and suggests avenues for interpreting how they work.

Ines Almeida

13.08.23 07:50 PM - Comment(s)

DisentQA: Catching Knowledge Gaps and Avoiding Misleading Users

Building QA Systems that catch knowledge gaps and avoid misleading users.

Ines Almeida

12.08.23 09:22 AM - Comment(s)

Peeking Inside the Black Box: Uncovering What AI Models Know About Books

New research from the University of California, Berkeley sheds light on one slice of these models' knowledge: which books they have "read" and memorized. The study uncovers systematic biases in what texts AI systems know most about.