Skip to content

Rostlab/pbc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Protein (language model) Benchmarking Collection - PBC

This repository contains well-established datasets for interpretable and reliable protein language model (pLM) benchmarking.

Datasets

All included datasets are listed below. Details and files can be found in the respective folders.

Supervised

Experimental

The following experimental datasets can be found on a separate branch. They are not part of the official release.

  • (Supervised) binding
    • Known limitation: Dataset size
  • (Supervised) membrane
    • Known limitation: Data imbalance

Benchmarking

If you want to benchmark a new or existing pLM on these datasets, please check out one of the following methods: