Category Archives: Immunoinformatics

Making your code pip installable

aka when to use a CutomBuildCommand or a CustomInstallCommand when building python packages with setup.py

Bioinformatics software is complicated, and often a little bit messy. Recently I found myself wading through a python package building quagmire and thought I could share something I learnt about when to use a custom build command and when to use a custom install command. I have also provided some information about how to copy executables to your package installation bin. **ChatGPT wrote the initial skeleton draft of this post, and I have corrected and edited.

Next time you need to create a pip installable package yourself, hopefully this can save you some time!

Continue reading

Conference Summary: AIRR Community Meeting VII – Learnings and Perspectives

At the start of June, we (Lewis and Benjie) attended the AIRR Community meeting in beautiful and sunny Porto, Portugal. This meeting was focused on collecting and analysing adaptive immune receptor repertoires. This comprised of two rivalling factions at the conference: the antibody (Ab) people or the T cell antigen receptor (TCR) people. The split was nearly fifty-fifty between these two topics throughout the conference. Overall, the conference was a fairly comfortable size, with approximately a hundred people in attendance, making it easy to visit all of the posters and talk with many people in your area, without feeling too niche. There was a wide variety of content formats throughout the conference including posters, scientific talks, lightning talks, software demos, and hands-on tutorials. In the following section, we highlight some of our favourite sessions to give a flavour of what this meeting entails.

Continue reading

Conference Summary: MGMS Adaptive Immune Receptors Meeting 2024

On 5th April 2024, over 60 researchers braved the train strikes and gusty weather to gather at Lady Margaret Hall in Oxford and engage in a day full of scientific talks, posters and discussions on the topic of adaptive immune receptor (AIR) analysis!

Continue reading

Useful metrics and their meanings

Short and selfish blog here. Probably been done before, but I shall carry on regardless. I am going to review some metrics relevant to our area of Immunoinformatics. In other words, I will try dissect things such as perplexity, logits, pTM, pLDDT and the ABodyBuilder2 confidence score. These numbers can help inform us on the likelihood of predictions, and whether we should have confidence in them.

Continue reading

Working with PDB Structures in Pandas

Pandas is one of my favourite data analysis tools working in Python! The data frames offer a lot of power and organization to any data analysis task. Here at OPIG we work with a lot of protein structure data coming from PDB files. In the following article I will go through an example of how I use pandas data frames to analyze PDB data.

Continue reading

The Antibody Dictionary

Similar to getting lost in a language when moving country, you might encounter a language barrier when moving research fields. This dictionary will guide you in the complex world of immunoinformatics, with a focus on antibodies. Whether your main research will be in this field, you want to apply your machine learning model on antibodies, or you just want to understand the research performed in OPIG, this dictionary will get you started.

The Antibody Dictionary:

Affinity maturation: The optimisation process of naive antibodies to memory antibodies such that the antibody is optimised for a specific antigen. 

Antibody: (immunoglobulin) a Y-shaped molecule important in the adaptive immune system. A canonical antibody consists of two identical heavy chains and two identical smaller light chains. 

Continue reading

What can you do with the OPIG Immunoinformatics Suite? v3.0

OPIG’s growing immunoinformatics team continues to develop and openly distribute a wide variety of databases and software packages for antibody/nanobody/T-cell receptor analysis. Below is a summary of all the latest updates (follows on from v1.0 and v2.0).

Continue reading

Exploring the Observed Antibody Space (OAS)

The Observed Antibody Space (OAS) [1,2] is an amazing resource for investigating observed antibodies or as a resource for training antibody specific models, however; its size (over 2.4 billion unpaired and 1.5 million paired antibody sequences as of June 2023) can make it painful to work with. Additionally, OAS is extremely information rich, having nearly 100 columns for each antibody heavy or light chain, further complicating how to handle the data. 

From spending a lot of time working with OAS, I wanted to share a few tricks and insights, which I hope will reduce the pain and increase the joy of working with OAS!

Continue reading