Machine Learning for Graphs at ICML 2023 | by Michael Galkin

CLIP and its descendants have become a staple of text-image models. Can we do the same thing but for text-to-protein? Yes!

➡️ Xu, Yuan et al. here ProtST, a framework for learning joint representations of textual protein descriptions (via PubMedBERT) and protein sequences (via ESM). In addition to contrastive loss, ProtST has a multimodal mask prediction objective, e.g. masking 15% of the tokens in the text and protein sequence, and predicting these jointly based on latent representations, and masking losses predictions based on sequences or language only. In addition, the authors design a novel ProtDescribe dataset with 550,000 aligned protein sequence-description pairs. ProtST excels at many protein modeling tasks in the PEER reference, including annotation and localization of protein function, but also allows retrieval of proteins without firing directly from the textual description (see an example below). Looks like ProtST has a bright future as the backbone of many generative protein models 😉

In fact, ICML offers several protein generation jobs like GENIUS by Lin and AlQuraishi And FrameDiff by Yim, Trippe, De Bortoli, Mathieu et al. – these are not yet conditioned by textual descriptions, so integrating ProtST into them looks like an obvious performance improvement 📈.

⚛️ MPNNs on molecules have a strict locality bias that inhibits modeling of long-range interactions. Kosmala et al. derive Passage of Ewald’s message and apply the idea of Summary of Ewald which breaks down the potential for interaction into short and long term terms. The short range interaction is modeled by any GNN while the long range interaction is new and is modeled with a 3D Fourier Transform and message passing on Fourier frequencies. It turns out that this long term is quite flexible and can be applied to any network modeling periodic and aperiodic systems (like crystals or molecules) like SchNet, DimeNet or GemNet. The model was evaluated on the OC20 and OE62 datasets. If you are interested in more details, check out the One-hour lecture by Arthur Kosmala to the LOG2 Reading Group!

A similar idea of using Ewald summation for 3D crystals is used in PotNet by Lin et al. where the long-range connection is modeled with incomplete Bessel functions. PotNet has been evaluated on the Materials Project dataset and JARVIS — so by reading these two papers you can get a good understanding of the benefits of Ewald summation for many crystal-related tasks 🙂

➡️ Another look at impregnation any of them GNNs with equivariance for crystals and molecules are given by Duval, Schmidt et al. In FAENet. A standard method is to integrate certain symmetries and equivariances directly into GNN architectures (such as in EGNN, GemNet and Ewald Message Passing) – this is a safe but computationally expensive method (especially when dealing with harmonics spherical and tensor products). Another option often used in vision: show many augmentations of the same input and the model should eventually learn the same invariances in the augmentations. The authors take the second route and design a rigorous way to sample invariant or equivariant augmentations of 2D/3D data (e.g., for energy or forces, respectively), all with fancy proofs ✍️. For this, the data augmentation pipeline includes projecting the 2D/3D inputs to a canonical representation (based on PCA of the distance covariance matrix) from which we can uniformly sample rotations.

The proposed FAENet is a simple model that only uses distances but shows very good performance with the stochastic frame averaging the data augmentation while being 6 to 20 times faster. Also works for crystal structures!

Latest News

Samsung Unveils New Refrigerators Featuring Innovative AI Hybrid Cooling Technology at CES 2025 – Samsung Global Newsroom

Transform Customer Feedback into Actionable Insights with CrewAI and Streamlit | by Alan Jones | December 2024

China’s Shenzhen technology center issues ‘vouchers’ to support AI research and development

The quantum leap: D-Wave’s revolutionary financing. Is the future of AI and cybersecurity here?

AI detection and personality generators: preserving authenticity online

Bangkok Post – New AI-related cybersecurity threats expected to proliferate in 2025

The essential role of cybersecurity in the sustainability of businesses, AND CISO

The quantum leap: D-Wave’s revolutionary financing. Is the future of AI and cybersecurity here?

AI detection and personality generators: preserving authenticity online

Bangkok Post – New AI-related cybersecurity threats expected to proliferate in 2025

The essential role of cybersecurity in the sustainability of businesses, AND CISO

AI is great, but agencies need to remember that in 2025 they will be in marketing

Marketing and AI integrations: marketing experiences

Why AI Could Be the Best Thing to Happen to Marketing

The Meta Marketing Summit is back – register now to drive growth in 2025

AI is great, but agencies need to remember that in 2025 they will be in marketing

Marketing and AI integrations: marketing experiences

Why AI Could Be the Best Thing to Happen to Marketing

The Meta Marketing Summit is back – register now to drive growth in 2025

China’s Shenzhen technology center issues ‘vouchers’ to support AI research and development

The Most Popular AI Tools of 2024

Updates to Veo, Imagen and VideoFX, and introduction of Whisk to Google Labs

Congress releases AI policy plan

China’s Shenzhen technology center issues ‘vouchers’ to support AI research and development

The Most Popular AI Tools of 2024

Updates to Veo, Imagen and VideoFX, and introduction of Whisk to Google Labs

Congress releases AI policy plan

Exploring the Power of AI and ML in Smart Grids: Advances, Applications and Challenges

Unsupervised ML 17 — Future Trends in Unsupervised Machine Learning: What’s Next? | by Ayşe Kübra Kuyucu | December 2024

FrontiersMachine learning applications in search of life beyond EarthMachine learning (ML) and artificial intelligence (AI) have moved beyond niche applications to become transformative and essential tools for analyzing data….2 days

ML breakthroughs win 2024 Nobel Prize in Physics

Exploring the Power of AI and ML in Smart Grids: Advances, Applications and Challenges

Unsupervised ML 17 — Future Trends in Unsupervised Machine Learning: What’s Next? | by Ayşe Kübra Kuyucu | December 2024

FrontiersMachine learning applications in search of life beyond EarthMachine learning (ML) and artificial intelligence (AI) have moved beyond niche applications to become transformative and essential tools for analyzing data….2 days

ML breakthroughs win 2024 Nobel Prize in Physics

Exploring the Power of AI and ML in Smart Grids: Advances, Applications and Challenges

Unsupervised ML 17 — Future Trends in Unsupervised Machine Learning: What’s Next? | by Ayşe Kübra Kuyucu | December 2024

FrontiersMachine learning applications in search of life beyond EarthMachine learning (ML) and artificial intelligence (AI) have moved beyond niche applications to become transformative and essential tools for analyzing data….2 days

Latest News

Subscribe to Updates

Machine Learning for Graphs at ICML 2023 | by Michael Galkin

Related Posts

Subscribe to Updates