publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2023

  1. paper_17553.jpeg
    Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark
    Fazl Barez*, Julia Persson*, Ioannis Konstas Esben Kran, and 1 more author
    2023
  2. brownian-motion.gif
    The Larger they are, the Harder they Fail: Language Models do not Recognize Identifier Swaps in Python
    Antonio Valerio Miceli Barone*, Fazl Barez*, Ioannis Konstas, and 1 more author
    2023
  3. paper_19911.jpeg
    Neuron to Graph: Interpreting Language Model Neurons at Scale
    Alex Foote*, Neel Nanda, Fazl Barez*, and 3 more authors
    2023
  4. paper_13121.jpeg
    Understanding Addition in Transformers
    Philip Quirke, and Fazl Barez
    2023
  5. paper_08164.jpeg
    Interpreting Reward Models in RLHF-Tuned Language Models Using Sparse Autoencoders
    Luke Marks, Amir Abdullah, Luna Mendez, and 3 more authors
    2023
  6. paper_05876.jpeg
    AI Systems of Concern
    Kayla Matteucci, Shahar Avin, Fazl Barez, and 1 more author
    2023
  7. paper_01870.jpeg
    DeepDecipher: Accessing and Investigating Neuron Activation in Large Language Models
    Fazl Barez Albert Garde
    2023
  8. paper_19911.jpeg
    The Alan Turing Institute’s response to the House of Lords Large Language Models Call for Evidence
    Fazl Barez, Philip H. S. Torr, Aleksandar Petrov, and 24 more authors
    2023
  9. paper_09826.jpeg
    Fairness in AI and Its Long-Term Implications on Society
    Ondrej Bohdal*, Timothy Hospedales, Philip H. S. Torr, and 1 more author
    2023
  10. paper_13850.jpeg
    Exploring the Advantages of Transformers for High-Frequency Trading
    Fazl Barez, Paul Bilokon, Arthur Gervais, and 1 more author
    2023
  11. paper_12561.jpeg
    Benchmarking Specialized Databases for High-frequency Data
    Fazl Barez, Paul Bilokon, and Ruijie Xiong
    2023
  12. articulated_mm
    Identifying a Preliminary Circuit for Predicting Gendered Pronouns in GPT-2 Small
    Chris Mathwin, Guillaume Corlouer, Esben Kran, and 2 more authors
    2023

2022

  1. brownian-motion.gif
    PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
    Pengyi Li, Hongyao Tang, Tianpei Yang, and 7 more authors
    2022
  2. paper_12561.jpeg
    System III: Learning with Domain Knowledge for Safety Constraints
    Fazl Barez, Hosien Hasanbieg, and Alesandro Abbate
    2022

2021

  1. ED2: An Environment Dynamics Decomposition Framework for World Model Construction
    Cong Wang, Tianpei Yang, Fazl Barez, and 7 more authors
    2021
  2. Discovering topics and trends in the UK Government web archive
    David Beavan, Fazl Barez, M Bel, and 4 more authors
    2021