Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 800 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 800 Bytes

Language Model Risk Cards: Starter Set

A set of Language Model Risk cards for assessing a language model use case. To use these:

  • Choose what use-case, model, and interface is to be assessed
  • Select which of these risk cards is relevant in the given use-case scenario
  • Recruit people to do the assessment
  • For each risk card,
    • Design how one will probe the model, and for how long
    • Try to provoke the described behaviour from the language model, using your own prompts
    • Record all inputs and outputs
  • Compile an assessment report

Full details are given in the paper: Assessing Language Model Deployment with Risk Cards (2023), Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, M. R. Leiser, Saif Mohammad