about

cameron-venti-3rabTGLccwc-unsplash.jpg

There is strong scientific evidence on the adverse effects of climate change on the global ocean. These changes will have a drastic impact on almost all life forms in the oceans with further consequences on food security, the ecosystems in coastal and inland communities. Despite these impacts, scientific data and infrastructures are still lacking to understand better and quantify the consequence of these perturbations on the marine ecosystem. It is necessary not only to gather more data but also to develop and apply state-of-the-art mechanisms capable of turning this data into effective knowledge, policies, and action. This is where artificial intelligence, machine learning, and modeling tools are called for.

This Inria Challenge OcéanIA aims at developing new artificial intelligence and mathematical modeling tools to contribute to the understanding of the structure, functioning, underlying mechanisms, and dynamics of the oceans and their role in regulating and sustaining the biosphere, and tackling the climate change. OcéanIA is then an opportunity to structure Inria’s contributions around a global scientific challenge in the convergence of Artificial Intelligence, Biodiversity & Climate Change.

OcéanIA is a four-years project (11.2020–10.2024) involving Inria teams in Chile, Paris, Saclay, and Sophia-Antipolis, and the Fondation Tara Océan, the Center of Mathematical Modeling (CMM, U.Chile), the Pontificia Universidad Católica de Chile (PUC), the GO-SEE CNRS Federation, and the Laboratoire des Sciences du Numérique de Nantes (LS2N). See the full description of the team here.

                        

goals

The goals of the project are structured in two directions. One that gathers the work from computer science and applied math to meet the challenges of the problem. The other focuses on applying the results of the first in multi-disciplinary application contexts.

The goals are discussed in depth in the project white book:

  • Sanchez-Pi, N., Martí, L., Abreu, A., Bernard, O., de Vargas, C., Eveillard, D., Maass, A., Marquet, P. A., Sainte-Marie, J., Salomon, J., Schoenauer, M., & Sebag, M. (2021). OcéanIA: AI, Data, and Models for Understanding the Ocean and Climate Change. (N. Sanchez-Pi & L. Martí, Eds.). Lille, Paris, Saclay, Santiago, Sophia-Antipolis: Inria – Institut national de recherche en sciences et technologies du numérique. download pdf

Computer science and math objectives

Data governance, curation, and availability Data governance policy for marine biology and oceanographic data, consolidated access to data, and scientific computing software stacks.
Structured and graph-based neural networks Model depth, models scalability, graph topological heterogeneity, and dynamic graphs.
Learning and adaptation in small data contexts Transfer learning and domain adaptation, active and few-shot learning, and multi-source and multi-task learning deep neural models.
Causality and explainable models in AI Causal inference, explainable AI and adversarial machine learning, interpretable shadow models and causal inference for understanding internal representations.
Model-driven and data-driven integration and hybrids Learning PDEs from data, understanding neural network learning dynamics, and hybrid models combining PDE solvers and neural networks.
Development, calibration, and validation of mechanistic models Identifiability issues, metabolic model reduction, and Navier-Stokes equations: From Eulerian to Lagrangian.

Multi-disciplinary applied objectives

Biodiversity and ecosystem functioning Meta-metabolic modeling, phytoplankton biodiversity concerning temperature, present, and future, Data assimilation in biogeochemical models: Predicting the future.
Understanding plankton communities using AI, ML, and vision Plankton identification from satellite images, connecting images and genomic features, anomaly detection, and explainable AI for automatic plankton discovery.


Interrelation of the objectives.

team

The OcéanIA team has diverse combination of skills, experience, and interests, something that is necessary to address a research-intensive and multi-disciplinary project such as this one.

Scientific Committee

Nayat Sánchez-Pi

Lead

Inria Chile Research Center

Artificial Intelligence

Luis Martí

Co-lead

Inria Chile Research Center

Machine learning

Julien Salomon

Team coordinator

ANGE project team - Inria Paris

Applied mathematics

Jacques Sainte-Marie

Team coordinator

ANGE project team - Inria Paris

Senior scientist

Olivier Bernard

Team coordinator

BIOCORE project team - Inria Sophia-Antipolis

Modelling, optimisation and monitoring of artificial ecosystems

Michèle Sebag

Team coordinator

TAU project team - Inria Saclay

Machine learning

Marc Schoenauer

Team coordinator

TAU project team - Inria Saclay

Machine learning

Alejandro Maass

Team coordinator

Center of Mathematical Modeling (CMM) - University of Chile

Ergodic theory and systems biology

Damien Eveillard

Team coordinator

ComBi - Nantes University

System biologiy and oceanography

André Abreu

Team coordinator

Fondation Tara Océans

International relations and economic development

Colomban de Vargas

Team coordinator

GO-SEE CNRS Federation

Marine biologist

Pablo Marquet

Team coordinator

Pontificia Universidad Católica de Chile

Macroecology

Researchers

Francesca Casagli

Resercher

BIOCORE project team - Inria Sophia-Antipolis

Modelling of artificial ecosystems

Louis Thiry

Postdoc

ANGE project team - Inria Paris

Stochastic modeling of ocean dynamics

Léon Mingus

PhD student

ANGE project team - Inria Paris

Romain Ranini

PhD student

BIOCORE project team - Inria Sophia-Antipolis

Modelling, Deep learning, Climate change, Primary production, Marine carbon pump

Ignacio Fierro

PhD student

BIOCORE project team - Inria Sophia-Antipolis

Luis Valenzuela

Postdoc

Inria Chile Research Center

Bioinformatics, genomics and machine learning

Hernan Lira

R&D Engineer

Inria Chile Research Center

Machine learning

Francisco Altimiras

Inria Chile Research Center


Research Engineers

Natalia Vidal

Inria Chile Research Center

Mia Elbo

Inria Chile Research Center

Astrid Reyes

Inria Chile Research Center

Sofía Callejas

Inria Chile Research Center


Former members

Hugo Carrillo

Postdoc

Pontificia Universidad Católica de Chile

Inverse problems, numerical analysis, mathematical modeling and simulation

Dante Travisany

Postdoc

Universidad de las Américas

Complex systems engineering

Ana Muñoz

Postdoc

Universidad Técnica Federico Santa María

Ontologies, Knowledge Management and Databases

Andrew Berry

R&D Engineer

Machine Learning, Computer Vision

Andrés Vignaga

Engineering team coordinator

Inria Chile Research Center

Software engineering and architecture

Publications and events

Diatom_chain.jpg

Upcoming workshops and events

  • Workshop AIMOCC 2024 at ECAI 2024 – AI: Modeling Oceans and Climate Change Workshop at ECAI 2024. Sánchez-Pi, N. and Martí, L. (eds). October 16-24, 2024. more information

Past meetings and workshops

  • OcéanIA 2023 annual meeting. February 23, 2024. more information
  • OcéanIA 2022 annual meeting. December, 2022. more information
  • Workshop AIMOCC 2022 at IJCAI 2022 – AI: Modeling Oceans and Climate Change Workshop at IJCAI 2022. Sánchez-Pi, N. and Martí, L. (eds). July 23-29, 2022. online proceedings
  • OcéanIA Challenge at IJCAI 2022: AI methods for determining ocean ecosystems from space: Combining genomic information, microscopic and satellite imagery. Sánchez-Pi, N. and Martí, L. (eds). July 23-29, 2022. more information
  • Workshop AIMOCC 2021 at ICLR 2021 – AI: Modeling Oceans and Climate Change Workshop at ICLR 2021. Sánchez-Pi, N. and Martí, L. (eds). May 7, 2021. online proceedings

Software

Click here to see the software being developed within the project.

Publications


Journal Articles

  1. Carrillo, H., de Wolff, T., Martí, L., & Sanchez-Pi, N. (2023). Evolutionary multi-objective physics-informed neural networks: The MOPINNs approach. AI Communications, 1–13. doi: 10.3233/aic-230073 bibtex
  2. Arroyo, J. I., Dı́ez Beatriz, Kempes, C. P., West, G. B., & Marquet, P. A. (2022). A general theory for temperature dependence in biology. Proceedings of the National Academy of Sciences of the United States of America, 119(30), e2119872119. doi: 10.1073/pnas.2119872119 abstract bibtex
  3. Aguayo, J., & Lincopi, H. C. (2022). Analysis of Obstacles Immersed in Viscous Fluids Using Brinkman’s Law for Steady Stokes and Navier-Stokes Equations. SIAM Journal Applied Mathematics, 82(4), 1369–1386. doi: 10.1137/20M138569X abstract bibtex
  4. Lira, H., Martí, L., & Sanchez-Pi, N. (2022). A Graph Neural Network with Spatio-Temporal Attention for Multi-Sources Time Series Data: An Application to Frost Forecast. Sensors, 22(4). doi: 10.3390/s22041486 hal: hal-03541565 abstract bibtex
  5. Demory, D., Weitz, J. S., Baudoux, A.-C., Touzeau, S., Simon, N., Rabouille, S., Sciandra, A., & Bernard, O. (2021). A thermal trade-off between viral production and degradation drives virus-phytoplankton population dynamics. Ecology Letters, 24(6), 1133–1144. doi: 10.1111/ele.13722 bibtex

Books

  1. The OcénIA Project White Book:  Sanchez-Pi, N., Martí, L., Abreu, A., Bernard, O., de Vargas, C., Eveillard, D., Maass, A., Marquet, P. A., Sainte-Marie, J., Salomon, J., Schoenauer, M., & Sebag, M. (2021). OcéanIA: AI, Data, and Models for Understanding the Ocean and Climate Change. (N. Sanchez-Pi & L. Martí, Eds.). Lille, Paris, Saclay, Santiago, Sophia-Antipolis: Inria – Institut national de recherche en sciences et technologies du numérique. hal: hal-03274323 pdf bibtex

Conference papers

  1. Vasconcellos, E. C., Sampaio, R. M., Araújo, A. P. D., Gonzales Clua, E. W., Preux, P., Guerra, R., Gonçalves, L. M. G., Martí, L., Lira, H., & Sanchez-Pi, N. (2023). Reinforcement-learning robotic sailboats: Simulator and preliminary results. In NeurIPS 2023 Workshop on Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models. New Orleans, United States. hal: hal-04395990 pdf bibtex
  2. Ferreira de Moraes, R., Evangelista, R. dos S., Pereira, A. L. da S., Toledo, Y. P., Fernandes, L. A. F., & Martí, L. (2023). Heuristics to reduce linear combinations of activation functions to improve image classification. In 2023 36th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) (pp. 169–174). IEEE. doi: 10.1109/SIBGRAPI59091.2023.10347043 hal: hal-04396390 abstract bibtex
  3. Fierro Ulloa, J. I., & Bernard, O. (2023). Physic-informed neural networks for microalgae modeling. In ECCE 14 & ECAB 7 - 14th European Congress of Chemical Engineering and 7th European Congress of Applied Biotechnology. Berlin, Germany. hal: hal-04390804 bibtex
  4. de Wolff, T., Carrillo Lincopi, H., Sanchez-Pi, N., & Martí, L. (2022). MOPINNs: An Evolutionary Multi-Objective Approach to Physics-Informed Neural Networks. In Proceedings of The Genetic and Evolutionary Computation Conference 2022 (GECCO ’22). New York, NY, USA: Association for Computing Machinery. doi: 10.1145/3520304.3529071 bibtex
  5. Carrillo, H., de Wolff, T., Martí, L., & Sanchez-Pi, N. (2022). Towards Optimally Weighted Physics-Informed Neural Networks in Ocean Modelling. In N. Sanchez-Pi & L. Martí (Eds.), Workshop AI: Modeling Oceans and Climate Change (AIMOCC 2022) of the 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI 2022). bibtex
  6. de Wolff, T., Carrillo, H., Martí, L., & Sanchez-Pi, N. (2022). Optimal Architecture Discovery for Physics-Informed Neural Networks. In A. C. Bicharra Garcia, M. Ferro, & J. C. Rodríguez Ribón (Eds.), Advances in Artificial Intelligence — IBERAMIA 2022 (pp. 77–88). Cham: Springer International Publishing. doi: 10.1007/978-3-031-22419-5_7 abstract bibtex
  7. Ferreira de Moraes, R., Evangelista, R. dos S., Fernandes, L. A. F., & Martí, L. (2021). GCOOD: A Generic Coupled Out-of-Distribution Detector for Robust Classification. In 2021 34th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) (pp. 409–416). doi: 10.1109/SIBGRAPI54419.2021.00062 hal: hal-04396403 abstract bibtex
  8. Lira, H., Martí, L., & Sanchez-Pi, N. (2021). Frost forecasting model using graph neural networks with spatio-temporal attention. In N. Sanchez-Pi & L. Martí (Eds.), AI: Modeling Oceans and Climate Change Workshop at ICLR 2021. Santiago, Chile. hal: hal-03259658 pdf bibtex
  9. de Wolff, T., Carrillo, H., Martí, L., & Sanchez-Pi, N. (2021). Assessing Physics Informed Neural Networks in Ocean Modelling and Climate Change Applications. In N. Sanchez-Pi & L. Martí (Eds.), AI: Modeling Oceans and Climate Change Workshop at ICLR 2021. Santiago (Virtual), Chile. hal: hal-03262684 pdf bibtex
  10. Muñoz, A., Martí, L., & Sanchez-Pi, N. (2021). Data Governance, a Knowledge Model Through Ontologies. In R. Valencia-García, M. Bucaram-Leverone, J. Del Cioppo-Morstadt, N. Vera-Lucio, & E. Jácome-Murillo (Eds.), Technologies and Innovation (pp. 18–32). Cham: Springer International Publishing. doi: 10.1007/978-3-030-88262-4_2 abstract bibtex
  11. Sanchez-Pi, N., Martí, L., Abreu, A., Bernard, O., de Vargas, C., Eveillard, D., Maass, A., Marquet, P. A., Sainte-Marie, J., Salomon, J., Schoenauer, M., & Sebag, M. (2020). Artificial Intelligence, Machine Learning and Modeling for Understanding the Oceans and Climate Change. In D. Dao, E. Sherwin, P. Donti, L. Kaack, L. Kuntz, Y. Yusuf, D. Rolnick, C. Nakalembe, C. Monteleoni, & Y. Bengio (Eds.), Tackling Climate Change with Machine Learning workshop at NeurIPS 2020. hal: hal-03138712 pdf slides abstract bibtex

Keynotes and talks

  1. Fierro Ulloa, J. I. (2023). NeuralODEs for phytoplankton modeling. In Journées scientifiques Inria Chile 2023. bibtex
  2. Keynote:  Martí, L. (2023). Explainable AI for understanding plankton communities. In Journées scientifiques Inria Chile 2023. bibtex
  3. Keynote:  Martí, L. (2023). AI for (climate) good. In Escuela de Verano en Inteligencia Computacional. Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile. bibtex
  4. Valenzuela, L. (2023). Modeling global plankton communities via multinomics and ML approaches. In Journées scientifiques Inria Chile 2023. bibtex
  5. Sanchez-Pi, N., & Martí, L. (2023). Opportunities in Chile: from the origin of the universe to the depths of the ocean. In Journées scientifiques Inria Chile 2023. bibtex
  6. Carrillo, H. (2023). A Reduced Basis (RB) approach combined with Artificial Neural Networks (ANN) for optimal location of measurements for inverse problems in PDEs. In numhyp23: Numerical Methods for Hyperbolic Problems. bibtex
  7. Keynote:  Sanchez-Pi, N. (2023). AI, the Ocean and Climate Change. In KHIPU: Latin American Meeting on Artificial Intelligence. bibtex
  8. Invited tutorial:  Sanchez-Pi, N., & Martí, L. (2021). Towards a Green AI: Evolutionary Solutions for an Ecologically Viable Artificial Intelligence. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (pp. 1135–1140). New York, NY, USA: Association for Computing Machinery. doi: 10.1145/3449726.3461428 bibtex
  9. Keynote:  Sanchez-Pi, N. (2020). OcéanIA: AI, Oceans and Climate Change. In A. Ruiz-Garcia, I. Arraut, J. M. Banda, I. Lopez-Francos, F. Latorre, P. Magalhães Braga, K. Caballero Barajas, S. H. Garrido Mejia, E. U. Moya-Sánchez, V. Fernandes Caridá, C. Miranda, G. Bejarano, & J. Ortega Caro (Eds.), LatinX in AI (LXAI) workshop at NeurIPS 2020. slides view online bibtex

Theses

  1. Migus, L. (2023, December). Deep neural networks and partial differential equations (phdthesis). Sorbonne Université. hal: tel-04336969 pdf bibtex
  2. Ferreira de Moraes, R. (2023). Novas Abordagens para Função de Perda e Função de Ativação para Classificação de Imagens (phdthesis). Universidade Federal Fluminense. bibtex

Other activities

  1. Martí, L. (2023, October). AI for (climate) good. Seminar Series of the Institute of Astrophysics Studies, Universidad Diego Portales. Institute of Astrophysics Studies, Universidad Diego Portales. bibtex
  2. Valenzuela, L. (2023, September). Inteligencia Artificial y Genómica para comprender los océanos y el cambio climático. F\hatete de la Science 2023 Chile. bibtex
  3. Project presentation to students:  Valenzuela, L. (2023). OcéanIA. Santiago, Chile: Universidad Austral de Chile. bibtex

proceedings

  1. Sanchez-Pi, N., & Martí, L. (Eds.). (2022). AI: Modeling Oceans and Climate Change Workshop (AIMOCC 2022). Vienna, Austria: 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI 2022). bibtex
  2. Sanchez-Pi, N., & Martí, L. (Eds.). (2021). AI: Modeling Oceans and Climate Change Workshop (AIMOCC 2021). Santiago de Chile (Virtual): Tenth International Conference on Learning Representations (ICLR 2021). bibtex

Online preprints

  1. de Wolff, T., Carrillo, H., Martí, L., & Sanchez-Pi, N. (2021, May). Towards Optimally Weighted Physics-Informed Neural Networks in Ocean Modelling. hal: hal-03260357 pdf bibtex

AIMOCC at ICLR 2021

AI: Modeling Oceans and Climate Change

An ICLR 2021 Workshop

It is our distinct pleasure to invite you to the AI: Modeling Oceans and Climate Change (AIMOCC 2021) Workshop to be held in conjunction with the Ninth International Conference on Learning Representations (ICLR 2021) and hosted in virtual-only mode.

The Anthropocene has brought along a drastic impact on almost all life forms on the planet. Considering the importance and amount of water in this speck of dust in the middle of nowhere that we inhabit, we should have called it Planet Ocean. Oceans are not only important because of their volume but are also about the functions and contributions they provide to biodiversity, the human species included.

The goal of this workshop is to bring together researchers that are interested and/or applying AI and ML techniques to problems related to marine biology, modeling, and climate change mitigation. We also expect to attract natural science researchers interested in learning about and applying modern AI and ML methods. Consequently, the workshop will be a first stone on building a multi-disciplinary community behind this research topic, with collaborating researchers that share problems, insights, code, data, benchmarks, training pipelines, etc. Together, we aim to ultimately address an urgent matter regarding the future of humankind, nature, and our planet.

Workshop programme

The workshop will take place on Friday, 7 May 2021. A zoom link will be shared to allow the participation anyone insterested.

  • Papers and presentations will be made available ASAP. Please note that programme times are in the CLT/EST (UTC-5h)
    • PST (UTC-8h): start at 06:00 (-3h).
    • CET (UTC+1h): start at 15:00 (+6h).
    • NST (UTC+8h): start at 23:00 (+14h).

Detailed programme (CLT/EST timezone)

  • 09:00 - 09:05. Opening comments and welcome by the organizers.

  • 09:05 - 09:45. Keynote presentation: Jacques Sainte-Marie, ANGE Team (Inria Paris and Sorbonne Université).

  • 09:45 - 10:05. Investigating Ground-level Ozone Formation: A Case Study in Taiwan. Yu-Wen Chen (Academia Sinica), Sourav Medya (Northwestern University), and Yi-Chun Chen (Academia Sinica). abstract paper (pdf)
  • 10:05 - 10:25. Model Discovery in the Sparse Sampling Regime. Gert-Jan Both, Georges Tod, and Remy Kusters (CRI). abstract paper (pdf)
  • 10:25 - 10:45. Physically-Consistent Generative Adversarial Networks for Coastal Flood Visualization. Björn Lütjens (MIT), Brandon Leshchinskiy (MIT), Christian Requena-Mesa (Computer Vision Group, Friedrich Schiller University Jena; DLR Institute of Data Science, Jena; Max Planck Institute for Biogeochemistry, Jena), Farrukh Chishtie (Spatial Informatics Group), Natalia Diaz Rodriguez (ENSTA Paris and INRIA Flowers), Oceane Boulais (NOAA), Aruna Sankaranarayanan (MIT), Aaron Piña (NASA Headquarters), Yarin Gal (University of Oxford), Chedy Raissi (INRIA), Alexander Lavin (Institute for Simulation Intelligence), and Dava Newman (MIT). abstract paper (pdf)

  • 10:45 - 11:05. Coffee break and short paper discussions.
  • Short papers:
    • PCE-PINNs: Physics-Informed Neural Networks for Uncertainty Propagation in Ocean Modeling. Björn Lütjens (MIT), Mark Veillette (MIT Lincoln Laboratory), Dava Newman (MIT), and Cait Crawford (IBM). abstract
    • Generative modeling of spatio-temporal weather patterns with extreme event conditioning. Konstantin Klemmer (University of Warwick), Sudipan Saha (Technical University of Munich), Matthias Kahl (Technical University of Munich), Tianlin Xu (London School of Economics and Political Science), and Xiaoxiang Zhu (Technical University of Munich). abstract paper (pdf)
    • CropGym: A reinforcement learning environment for crop management. Hiske Overweg, Herman Berghuijs, and Ioannis N. Athanasiadis (Wageningen University and Research). abstract paper (pdf)
    • Frost Forecasting Model using Graph Neural Networks with Spatio-Temporal Attention Hernán Lira, Luis Martí, and Nayat Sanchez-Pi (Inria Chile Research Center). abstract paper (pdf)
  • 11:05 - 11:45. Keynote presentation: Daniele Iudicone (Stazione Zoologica Anton Dohrn).

  • 11:45 - 12:05. Feature Importance in a Deep Learning Climate Emulator. Wei Xu (Brookhaven National Laboratory), Xihaier Luo (Brookhaven National Laboratory), Yihui (Ray) Ren (Brookhaven National Laboratory), Ji Hwan Park (Brookhaven National Laboratory), Shinjae Yoo (Brookhaven National Laboratory), and Balu Nadiga (Los Alamos National Lab). abstract paper (pdf)
  • 12:05 - 12:25. Assessing Physics Informed Neural Networks in Ocean Modelling and Climate Change Applications. Taco de Wolff, Hugo Carrillo Lincopi, Luis Martí, and Nayat Sanchez-Pi (Inria Chile Research Center). abstract paper (pdf)
  • 12:25 - 12:45. Deep Embedded Clustering for BioAcoustic Clustering of Marine Mammal Vocalization. Ali Jahangirnezhad (University of Washington Bothell) and Afra Mashhadi (University of Washington). abstract paper (pdf)

  • 12:45 - 13:20. Keynote presentation: Michèle Sebag, TAU Team (LISN, Inria, CNRS, and Univ. Paris Saclay).

  • 13:20 - 13:30. Final remarks and open discussion.

Submissions

We welcome submissions of long (8 pages) full papers and short (4 pages) summary papers. To prepare your submission, please use the ICLR 2021 LaTeX style files provided at: https://github.com/ICLR/Master-Template. Use the following link to submit your proposal(s): AIMOCC 2021 CMT submission site.

Important dates

  • [Updated] Submission deadline : April 12, 2021 (UTC-12).
  • Notification of acceptance: April 19, 2021.
  • [Updated] Reception of final version: May 5, 2021.

Topics

Topics of interest of this workshop can be grouped into two sets:

  1. Addressing and advancing the state of the art in areas like AI, ML, mathematical modeling and simulation. Here the focus is set on:
  • improving neural network handling of graph-structured information,
  • improving the capacity of ML methods to learn in small data contexts,
  • understanding causal relations, interpretability and explainability in AI,
  • integrating model-driven and data-driven approaches, and
  • to develop, calibrate, and validate existing mechanistic models.
  1. Focus on answering the questions from the application domain, where the main questions to be addressed are:
  • Which are the major patterns in plankton taxa and functional diversity?
  • Which are the major drivers of patterns and how do they interact?
  • How these patterns and drivers will likely change under climate change?
  • How will these changes affect the capacity of ocean ecosystems to sequester carbon from the atmosphere, that is the biological carbon pump?
  • What relations bind communities and local conditions?
  • What are the links between biodiversity functioning and structure?
  • How modern AI and computer vision can be applied as research and discovery support tools to understand planktonic communities?
  • How new biological knowledge can be derived from the application of anomaly detection, causal learning, and explainable AI.

Organizers

  • Nayat Sánchez-Pi and Luis Martí, Inria Chile.

Scientific committee

  • José Manuel Molina, Universidad Carlos III de Madrid,
  • Julien Salomon and Jacques Sainte-Marie, Inria Paris,
  • Olivier Bernard, Inria Sophia-Antipolis,
  • Michèle Sebag and Marc Schoenauer, Inria Saclay,
  • Alejandro Maass, Center of Mathematical Modeling (CMM), Universidad de Chile,
  • Pablo Marquet, Pontificia Universidad Católica de Chile (PUC),
  • André Abreu, Fondation TARA Océan,
  • Ana Cristina Garcia, Unirio - Federal University of Rio de Janeiro State,
  • Hernán Lira, Inria Chile Research Center,
  • Hugo Carrillo Lincopi, Inria Chile Research Center,
  • Leandro Fernandes, Universidade Federal Fluminense,
  • Roberto Santana, University of the Basque Country (UPV/EHU),
  • Colomban De Vargas, GO-SEE CNRS Federation, and
  • Damien Eveillard, ComBi, Université de Nantes.

Diversity commitment

We will seek diversity in all aspects, both in school of thought, nationalities, stages in the academic career, etc.

Access

We will publish the accepted papers and talk abstracts (before the event) and the slides of the speakers (after the event) on the workshop website. We will include a bibliography of most relevant research papers to facilitate cross pollination of ideas between these fields. Similarly, we will record the workshop and publish it online.

AIMOCC at IJCAI 2022

AI: Modeling Oceans and Climate Change 2022

An IJCAI-ECAI 2022 Workshop

It is our distinct pleasure to invite you to the Workshop AI: Modeling Oceans and Climate Change (AIMOCC 2022) to be held in conjunction with the 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI 2022) on July 23-29, 2022, in Messe Wien, Vienna, Austria.

The Anthropocene has brought along a drastic impact on almost all life forms on the planet. Considering the importance and amount of water in this speck of dust in the middle of nowhere that we inhabit, we should have called it Planet Ocean. Oceans are not only important because of their volume but are also about the functions and contributions they provide to biodiversity, the human species included.

The goal of this workshop is to bring together researchers that are interested and/or applying AI and ML techniques to problems related to marine biology, modeling, and climate change mitigation. We also expect to attract natural science researchers interested in learning about and applying modern AI and ML methods. Consequently, the workshop will be a first stone on building a multi-disciplinary community behind this research topic, with collaborating researchers that share problems, insights, code, data, benchmarks, training pipelines, etc. Together, we aim to ultimately address an urgent matter regarding the future of humankind, nature, and our planet.

This workshop has a related IJCAI-ECAI 2022 Challenge: AI methods for determining ocean ecosystems from space: Combining genomic information, microscopic and satellite imagery.

Topics

Topics of interest of this workshop can be grouped into two sets:

  1. Addressing and advancing the state of the art in areas like AI, ML, mathematical modeling and simulation. Here the focus is set on:
    • improving neural network handling of graph-structured information,
    • improving the capacity of ML methods to learn in small data contexts,
    • understanding causal relations, interpretability and explainability in AI,
    • integrating model-driven and data-driven approaches, and
    • to develop, calibrate, and validate existing mechanistic models.
  2. Focus on answering the questions from the application domain, where the main questions to be addressed are:
    • Which are the major patterns in plankton taxa and functional diversity?
    • Which are the major drivers of patterns, and how do they interact?
    • How these patterns and drivers will likely change under climate change?
    • How will these changes affect the capacity of ocean ecosystems to sequester carbon from the atmosphere, that is the biological carbon pump?
    • What relations bind communities and local conditions?
    • What are the links between biodiversity functioning and structure?
    • How modern AI and computer vision can be applied as research and discovery support tools to understand planktonic communities?
    • How new biological knowledge can be derived from the application of anomaly detection, causal learning, and explainable AI.

Detailed programme (in CET timezone)

  • When: 23 July 2022, 14:00-17:00 CET (08:00-11:00 CLT/EST, 09:00-12:00 BRT)
    • BRT timezone -5 hours; EST/CLT timezone -6 hours.
  • Attending in person: Room Schubert 1. Messe Wien, Vienna.
  • Attending online: Use the URI https://meet.jit.si/aimocc-2022.
  • 14:00 - 14:15. Opening comments and welcome by the organizers.

  • 14:15 - 14:40. A Physics-Informed Neural Network to Model Port Channels. Marlon S. Mathias1, Caio Fabricio Deberaldini Netto1, Marcel M.B. Barros1, Jefferson F. Coelho2, Lucas P. de Freitas1, Felipe M. Moreno1, Fabio Cozman1, Anna Helena Reali Costa 1, Eduardo Aoun Tannuri1, Edson S. Gomi 1, and Marcelo Dottori3. (1) University of São Paulo, (2) São Paulo University (POLI-USP), (3) Oceanographic Institute, University of São Paulo. abstract paper (pdf) online presentation

  • 14:40 - 15:05. Towards Optimally Weighted Physics-Informed Neural Networks in Ocean Modelling. Hugo Carrillo Lincopi, Taco de Wolff, Luis Martí, and Nayat Sánchez Pi. Inria Chile Research Center. abstract paper (pdf) online presentation

  • 15:05 - 15:30. Modeling Oceanic Variables with Dynamic Graph Neural Networks. Caio Fabricio Deberaldini Netto1, Marcel M.B. Barros1, Jefferson F. Coelho2, Felipe M. Moreno1, Marlon S. Mathias1, Lucas P. de Freitas1, Fabio Cozman1, Marcelo Dottori3, Eduardo Aoun Tannuri1, Edson S. Gomi1, and Anna Helena Reali Costa1. (1) University of São Paulo, (2) São Paulo University (POLI-USP), (3) Oceanographic Institute, University of São Paulo. abstract paper (pdf) online presentation

  • 15:30 - 16:00. Coffee break (we stay in the online call and chat).

  • 16:00 - 16:25. Enhancing Oceanic Variables Forecast in the Santos Channel by Estimating Model Error with Random Forests. Felipe M. Moreno1, Caio Fabricio Deberaldini Netto1, Marcel M.B. Barros1, Jefferson F. Coelho2, Lucas P de Freitas1, Marlon S. Mathias1, Luiz Schiaveto Neto3, Marcelo Dottori4, Fabio Cozman1, Anna Helena Reali Costa1, Edson S. Gomi1, and Eduardo Aoun Tannuri 1. (1) University of São Paulo, (2) São Paulo University (POLI-USP), (3) Escola Politécnica – University of Sao Paulo, (4) Oceanographic Institute, University of São Paulo. abstract paper (pdf) online presentation

  • 16:25 - 16:50. The BLue Amazon Brain (BLAB): A Modular Architecture of Services about the Brazilian Maritime Territory. Paulo Pirozelli1, Ais B.R. Castro1, Ana Luiza C. de Oliveira1, André Seidel1, Flávio N. Cação1, Igor C. Silveira1, João G M Campos1, Laura C. Motheo1, Leticia F. Figueiredo1, Lucas F.A.O. Pellicer1, Marcelo A. José1, Marcos M. José1, Pedro de M. Ligabue1, Ricardo S. Grava1, Rodrigo M. Tavares1, Vinícius B. Matos1, Yan V. Sym1, Anna Helena Reali Costa1, Anarosa Alves Franco Brandão2, Denis D. Maua1 Fabio Cozman1, Sarajane M. Peres1. (1) University of São Paulo, (2) Escola Politécnica – University of Sao Paulo. abstract paper (pdf) online presentation

  • 16:50 - 17:00. Final remarks.

  • 17:00 - until available. Open topic conversations.

Submissions

We welcome submissions of full papers (8 pages, not counting references) and short summary papers (4 pages, not counting references). Papers must be written in English and in PDF format according to the IJCAI-ECAI’22 style. All submitted papers will be under a single-blinded peer review for their novelty, technical quality and impact. The submissions can contain author details.

Important dates

  • Submission deadline extended! : June 4, 2022 (UTC-12) May 20, 2022 (UTC-12).
  • Notification of acceptance: June 11, 2022.
  • Reception of final version: June 18, 2022.

Post-proceedings publication

We will seek to publish selected, revised, extended papers later in a planned post-proceedings volume, to be published in the Lecture Notes in Artificial Intelligence (LNAI) series. The selection of papers will be managed by a subset of the workshop organizing committee.

Organizers

  • Nayat Sánchez-Pi, Inria Chile Research Center.
  • Pablo Marquet, Pontificia Universidad Católica de Chile.
  • Alejandro Maass, Center of Mathematical Modeling (CMM), Universidad de Chile.
  • Luis Martí, Inria Chile Research Center.

Scientific committee

  • José Manuel Molina, Universidad Carlos III de Madrid,
  • Julien Salomon, ANGE, Inria Paris,
  • Jacques Sainte-Marie, ANGE, Inria Paris,
  • Olivier Bernard, BIOCORE, Inria Sophia-Antipolis,
  • Michèle Sebag, TAU, Inria Saclay,
  • Marc Schoenauer, TAU, Inria Saclay,
  • Pablo Marquet, Pontificia Universidad Católica de Chile (PUC),
  • André Abreu, Fondation TARA Océan,
  • Ana Cristina Garcia Bicharra, Unirio - Federal University of Rio de Janeiro State,
  • Hernán Lira, Inria Chile Research Center,
  • Hugo Carrillo Lincopi, Inria Chile Research Center,
  • Andrew Berry, Inria Chile Research Center,
  • Luis Valenzuela, Inria Chile Research Center,
  • Leandro Fernandes, Universidade Federal Fluminense,
  • Roberto Santana, University of the Basque Country (UPV/EHU),
  • Colomban De Vargas, GO-SEE CNRS Federation, and
  • Damien Eveillard, ComBi, Université de Nantes.

Diversity commitment

We will seek diversity in all aspects, both in school of thought, nationalities, stages in the academic career, etc.

Access

We will publish the accepted papers and talk abstracts (before the event) and the slides of the speakers (after the event) on the workshop website. We will include a bibliography of most relevant research papers to facilitate cross-pollination of ideas between these fields. Similarly, we will record the workshop and publish it online.

OcéanIA IJCAI 2022 Challenge

AI methods for determining ocean ecosystems from space: Combining genomic information, microscopic and satellite imagery

An IJCAI-ECAI 2022 Challenge

It is our distinct pleasure to invite you to the Challenge AI methods for determining ocean ecosystems from space: Combining genomic information, microscopic and satellite imagery to be held in conjunction with the 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI-2022) on July 23-29, 2022, in Messe Wien, Vienna, Austria.

Context

The ocean is the Earth’s principal climate regulator and the main responsible for sequestering carbon dioxide (CO2). This makes it our main defense against climate change, but climate change itself is destroying the healing capacity of the ocean. Algae and, in particular, plankton, play a fundamental role in this, as they are able to remove CO2. Therefore, the mitigating capacity of an ecosystem can be established based on the presence of particular types of plankton. However, to health of the larger areas of the ocean can only be determined through large-scale measurements such as satellite imagery.

The challenge focuses on the remote identification via satellite imagery of high-potential ecosystems. This would allow large tracts of the ocean to be analyzed in a way that allows scientists and decision makers to understand how the ocean evolves over time and could be used to create policies for protecting high-value parts of the ocean. Alternatively, we propose to study the use of marker species, such as whales, which can be identified and their presence implies the existence of others.

This is an opportunity to attract the AI/ML community to this type of scientifically challenging and high-impact problem. For this we will make available to participants curated georeferenced datasets of plankton images, genomic data and satellite images and provide mentorship during the period of the challenge. It falls under the activities of Inria Project OcéanIA.

Goals

We propose to determine the variation of plankton species —i.e. ecosystems— inhabiting a given area of the ocean by cross-referencing genomic data, plankton microscope imaging and satellite images. This calls for the combined application of methods like:

  • causal inference,
  • explainable AI,
  • computer vision neural networks: representation learning, self-supervision, out of distribution detection,
  • ML methods for “small data contexts” like zero-shot/few-shot learning, and active learning, among others,
  • associative rule learning, and
  • domain adaptation and transfer learning, to mention a few.

Participation guide

The challenge will take place from 20 April 20 2022 to 29 July 2022. Teams can join the challenge at any time, but we suggest you that you do it as early as possible.

The challenge is organized in two phases:

  • Phase I: where participants work on a solution proposal and plan.
    • At the end of this phase, participants must submit a short paper (max. 2 pages excl. references) and (optionally) supplementary code.
    • Submitted proposals are evaluated, and selected ones are invited to take part of the phase II.
  • Phase II: where participants work on their challenge solutions.
    • At the end of this phase, participants should submit a full paper (max. 6 pages excl. references) and make available the supplementary code under an OSI approved license (i.e. MIT, Apache, etc.).

Datasets available to challenge participants

Getting involved

  • Join the mailing list: If you are interested to take part of the challenge, please let us know by filling up this form.
  • Join out discord server to get support, collaborate and exchange with other participants.
  • Follow Inria Chile on Twitter to for more news and updates.

Paper preparation instructions

Papers must be written in English and in PDF format according to the IJCAI-ECAI’22 style. All submitted papers will be under a single-blinded peer review for their novelty, technical quality and impact. The submissions can contain author details. See below for submission link.

Source code instructions

The challenge will help bring recent state-of-the-art AI/ML methods to tackle complex and high-impact problems that have a potential for global impact. Experts on this field have limited access and operational knowledge on how to use these advanced methods. Consequently, we will pay extra attention and involve participants in order to make their code contributions available in a form as usable as possible by non-AI/ML experts.

  • During the unfolding of the challenge source code availability (open source or private) will be left to the decision of the participants.
  • Derived and/or intermediate datasets that we consider of value will also be made freely available.
  • Upon acceptance, participants code should be made available online under an open-source friendly license, in particular it should be an OSI approved license.
  • We encourage participants to make their source code as easy to use as possible by providing installation scripts, instructions, etc.

Timeline of participation

  • Start of challenge (20 April 2022).
  • Phase I. Solution proposal preparation (20 April – 7 June 2022).
    • During this phase participants work on the conception of their solutions.
    • Participants are encouraged to interact via email or discord with organizers and other participants.
  • Submission of proposals (7 June 2022, UTC-12). Proposal submissions must include:
    • Short paper (max 2 pages excluding references) with the proposed solution, potential impact, planning, etc., and
    • Code repo (optional) link to code repository (i.e. GitHub, GitLab, etc.) with supplementary code. This code does not need to be public, but in that case organizers should have access granted to it.
    • CMT submission site: https://cmt3.research.microsoft.com/IJCAIOceanIAIChallenge2022
  • Notification of proposal acceptance (14 June 2022).
    • Challenge organizers will communicate which proposals will are accepted into Phase II.
  • Phase II. Construction of final solution (15 June – 15 July 2022).
    • During this phase accepted participants will work towards the solution to be presented in the challenge session at IJCAI-ECAI’22.
  • Submission of final solutions (16 July 2022, UTC-12): Final solution submissions must include:
  • Challenge session and awards at IJCAI-ECAI’22 (July 22-29, 2022): Participants will present their solutions in an in-person session in the conference. Only participants registered at ICJAI-ECAI’22 will be able to take part of the session.

Awards

We will provide small ocean-related gifts and cloud compute to the best contributions. Stay tuned for more details.

Publications and Post-proceedings

Dissemination is very important for the goals of the challenge. We will publish a non-archival proceedings booklet with the contributions and the main experiences gained during the challenge. Therefore, both the peer-review post volume and the challenge paper describing the results, experiences and lessons learned are interesting for us.

Organizers

Scientific committee

  • José Manuel Molina, Universidad Carlos III de Madrid,
  • Pablo Marquet, Pontificia Universidad Católica de Chile.
  • Julien Salomon, ANGE, Inria Paris,
  • Jacques Sainte-Marie, ANGE, Inria Paris,
  • Olivier Bernard, BIOCORE, Inria Sophia-Antipolis,
  • Michèle Sebag, TAU, Inria Saclay,
  • Marc Schoenauer, TAU, Inria Saclay,
  • Alejandro Maass, Center of Mathematical Modeling (CMM), Universidad de Chile,
  • Pablo Marquet, Pontificia Universidad Católica de Chile (PUC),
  • André Abreu, Fondation TARA Océan,
  • Ana Cristina Garcia Bicharra, Unirio - Federal University of Rio de Janeiro State,
  • Hernán Lira, Inria Chile Research Center,
  • Hugo Carrillo Lincopi, Inria Chile Research Center,
  • Leandro Fernandes, Universidade Federal Fluminense,
  • Roberto Santana, University of the Basque Country (UPV/EHU),
  • Colomban De Vargas, GO-SEE CNRS Federation, and
  • Damien Eveillard, ComBi, Université de Nantes.

Sponsorship

We are actively seeking support for different organizations. If you are interested to sponsor this challenge do not hesitate to contact us.

Diversity commitment

We will actively seek diversity in all aspects: schools of thought, theoretical backgrounds, nationalities, stages in the academic career, gender, etc. We will take an affirmative action to ensure that by disseminating the call for papers in diverse communities and offer a mentorship and assistance to help underrepresented and cross-disciplinary participants.

AIMOCC at ECAI 2024

fish-school.jpg

AI: Modeling Oceans and Climate Change 2024

An ECAI 2024 Workshop

It is our distinct pleasure to invite you to the Workshop AI: Modeling Oceans and Climate Change (AIMOCC 2024) to be held in conjunction with the 27th European Conference on Artificial Intelligence (ECAI 2024) on October 19-24, 2024 in Santiago de Compostela, Spain.

The Anthropocene has brought along a drastic impact on almost all life forms on the planet. Considering the importance and amount of water in this speck of dust in the middle of nowhere that we inhabit, we should have called it Planet Ocean. Oceans are not only important because of their volume but are also about the functions and contributions they provide to biodiversity, the human species included.

The goal of this workshop is to bring together researchers that are interested and/or applying AI and ML techniques to problems related to marine biology, modeling, and climate change mitigation. We also expect to attract natural science researchers interested in learning about and applying modern AI and ML methods. Consequently, the workshop will be a first stone on building a multi-disciplinary community behind this research topic, with collaborating researchers that share problems, insights, code, data, benchmarks, training pipelines, etc. Together, we aim to ultimately address an urgent matter regarding the future of humankind, nature, and our planet.

Topics

Topics of interest of this workshop can be grouped into two sets:

  1. Addressing and advancing the state of the art in areas like AI, ML, mathematical modeling and simulation. Here the focus is set on:
    • improving neural network handling of graph-structured information,
    • improving the capacity of ML methods to learn in small data contexts,
    • understanding causal relations, interpretability and explainability in AI,
    • integrating model-driven and data-driven approaches, and
    • to develop, calibrate, and validate existing mechanistic models.
  2. Focus on answering the questions from the application domain, where the main questions to be addressed are:
    • Which are the major patterns in plankton taxa and functional diversity?
    • Which are the major drivers of patterns, and how do they interact?
    • How these patterns and drivers will likely change under climate change?
    • How will these changes affect the capacity of ocean ecosystems to sequester carbon from the atmosphere, that is the biological carbon pump?
    • What relations bind communities and local conditions?
    • What are the links between biodiversity functioning and structure?
    • How modern AI and computer vision can be applied as research and discovery support tools to understand planktonic communities?
    • How new biological knowledge can be derived from the application of anomaly detection, causal learning, and explainable AI.

Submissions

We welcome submissions of full papers (8 pages, not counting references) and short summary papers (4 pages, not counting references). Papers must be written in English and in PDF format according to the ECAI’24 LaTeX template. All submitted papers will be under a single-blinded peer review for their novelty, technical quality and impact. The submissions can contain author details.

Important dates

  • Submission deadline: May 15, 2024 (UTC-12).
  • Notification of acceptance: July 15, 2024.
  • Reception of final version: August 10, 2024.

Post-proceedings publication

We will seek to publish selected, revised, extended papers later in a planned post-proceedings volume, to be published in the Lecture Notes in Artificial Intelligence (LNAI) series. The selection of papers will be managed by a subset of the workshop organizing committee.

Organizers

Scientific committee

  • José Manuel Molina, Universidad Carlos III de Madrid,
  • Julien Salomon, ANGE, Inria Paris,
  • Jacques Sainte-Marie, ANGE, Inria Paris,
  • Olivier Bernard, BIOCORE, Inria Sophia-Antipolis,
  • Michèle Sebag, TAU, Inria Saclay,
  • Marc Schoenauer, TAU, Inria Saclay,
  • Pablo Marquet, Pontificia Universidad Católica de Chile (PUC),
  • André Abreu, Fondation TARA Océan,
  • Ana Cristina Garcia Bicharra, Unirio - Federal University of Rio de Janeiro State,
  • Hernán Lira, Inria Chile Research Center,
  • Luis Valenzuela, Inria Chile Research Center,
  • Leandro Fernandes, Universidade Federal Fluminense,
  • Roberto Santana, University of the Basque Country (UPV/EHU),
  • Colomban De Vargas, GO-SEE CNRS Federation, and
  • Damien Eveillard, ComBi, Université de Nantes.

Diversity commitment

We will seek diversity in all aspects, both in school of thought, nationalities, stages in the academic career, etc.

Access

We will publish the accepted papers and talk abstracts (before the event) and the slides of the speakers (after the event) on the workshop website. We will include a bibliography of most relevant research papers to facilitate cross-pollination of ideas between these fields. Similarly, we will record the workshop and publish it online.

Software

Extract biologic subsequences
of interest from large FASTA files

Serverless cloud service. Focus on your query, not on managing storage or compute infraestructure.

Preloaded data catalog. No need to move large files around.

Access the service right from your Python code. Get query results as a Pandas DataFrame.
Run a sample query on a
Jupyter notebook now!

How does it work?

OcéanIA Platform lets you query large FASTA files in our supported data catalogs for extracting parts of biologic sequences. Just import our Python library in your code and access the query service.

ACCESS SUPPORTED DATA CATALOGS

We currently support the Ocean Microbial Reference Gene Catalog v2 (OM-RGC.v2) gene catalog from Tara Oceans Expedition. If you would like to try the service on files from other catalogs please contact us.

FOCUS ON THE QUERIES

Run your queries locally on your workstation either through a Python script or a Jupyter notebook. Install our Python package, then import our Python module in your code, and you are ready to go.

QUERY FASTA FILES

Extract multiple specific gene sequences from a file in a single query.

 

How to use

  • Run pip install oceania-query-fasta to install the Python package that enables access to query service.
  • From your Python code (either a script or a notebook) import the oceania module.
  • Select a file from the catalog, define your query, apply the query to the file. The result can be either retrieved as a Pandas DataFrame or saved as a CSV file.
A Jupyter notebook with a minimal example of a query that extracts a set of sequences from a given file in the catalog.


A Jupyter notebook with an example that extracts large intergenic regions (IGRs) from one surface sample.

news

Diatoms_through_the_microscope.jpg

Others

El PeriodistaTV

El late de los datos y la IA

10 Mar 2021

join us

Diatoms_through_the_microscope.jpg

Open positions in Chile

Additional information and other positions are listed in the Inria Chile website.

Open positions in France

2024 OcéanIA Annual Meeting

csiro_diatoms.jpg

The 2024 OcéanIA annual meeting took place on February 23, 2024, in the offices of Inria Paris.

The primary outcomes of the project so far can be summarized as:

  • Development of state-of-the-art computer vision techniques for explainable plankton identification, paving the way for a more cost-effective approach to describing oceanic ecosystems.
  • Identification of patterns linking genomic characteristics of ecosystems to their functions, enabling the recognition of valuable ecosystems on a global scale.
  • Advancements in modeling, physics-informed machine learning, and the coupling of complex systems models. These progressions aim to empower us to simulate climate change scenarios, predicting their impact on, and potential displacement of, plankton ecosystems.

The project’s diverse methodologies and interdisciplinary approaches have not only yielded significant findings but, perhaps more crucially, have laid the groundwork for an ambitious next phase.

Program

  • 09:00 - 09:30 Welcome coffee.
  • 09:30 - 09:40 Intro and words by Jean-Frédéric Gerbeau.
  • 09:40 - 10:00 Nayat Sanchez and Luis Martí (Inria Chile): Overall status of OcéanIA, organization of the day and presentations. slides
  • 10:00 - 10:30 Olivier Bernard and Francesca Casagli (BIOCORE): Effect of temperature and light on phytoplankton growth. slides
  • 10:30 - 11:00 Romain Ranini (BIOCORE): Spatio-temporal high-resolution models of particulate organic matter abundance in the ocean. slides
  • 11:00 - 11:30 Alejandro Maass (CMM, Univ. Chile): The CEODOS mission.
  • 11:30 - 12:00 M. Sebag, M. Schoenauer, and L. Martí (TAU + Inria Chile): Understanding ecosystems via plankton images. slides
  • 12:00 - 11:30 Pierre Peterlongo (DYLISS): Invited presentation about Inria Challenge OmicsFinder.
  • 12:30 - 13:00 Louis Thiry (ANGE): Simulation of several large-scale ocean models. slides
  • 13:00 - 15:00 Lunch at Le Repaire (100 Av. Daumesnil 75012)
  • 15:00 - 15:30 Luis Valenzuela (Inria Chile): Modeling global plankton communities via multiomics and ML approaches. slides
  • 15:30 - 16:00 Pablo Marquet (PUC): A general theory for temperature dependence in biology. slides
  • 16:00 - 16:30 Fabien Lombard and Jean-Olivier Irisson (Laboratoire d’Océanographie de Villefranche): Invited presentation about the Ecotaxa project.
  • 16:30 - 16:45 Pause coffee.
  • 16:45 - 17:10 Luis Martí (Inria Chile): Progress in Physics-informed ML and its applications in oceanic processes. slides
  • 17:10 - 17:30 Speed presentations by Inria Chile interns of 2024.
  • 17:30 - 18:00 Discussion of next steps and conclusions.

Project status notes

The project goal can be summarized as the pursuit of the answers to three increasingly complex interdependent questions: (i) how the Ocean mitigates climate change, (ii) how climate change will impact that capacity, and (iii) what can be done to protect the Ocean.

The overall organization of work in the project can be framed in a map/reduce schema. That is that in the first phase of the project (map) activities were focused on the specific research questions while in the second (reduce) phase would then focus on the convergence of the different results as results that can be shared and actionable.

Most of the activities reported in here fall under the map phase.

Understanding ecosystems via plankton images

We have made the first study on the application of state-of-the-art computer vision models for plankton identification. For this we have used massive raw, unprocessed large plankton datasets. We have also applied novel hierarchical classification, out-of-distribution detection methods, diversity-preserving loss functions, multi-modal/multi-task learning, strategies for handling class imbalance, and explainable AI methods to render the results more valuable.

We have also experimented with data augmentation and contrastive learning approaches to address the problem of identifying outliers (unknown species) using generalized contrastive losses and introducing fake outliers extracted from known images, or creating them as chimeras or applying generative AI methods.

Characterization of the Ocean carbon pump

The Ocean functions as a reservoir for CO2 through the biological carbon pump. This process is fueled by the metabolic diversity in plankton communities, which is encoded in their genetic material. Such diversity is a cornerstone in fueling global biogeochemical cycles, underpinning food webs, and aiding in climate regulation.

Two main efforts have been made around this: (i) identification of relations between (meta) omic features with ecosystem functions and (ii) global scale spatio-temporal characterization of the carbon pump using the organic matter abundance.

In the first effort we focused on deciphering global patterns within the matrices derived from marine microbial datasets. To achieve this, we utilized unsupervised machine learning methods, such as uniform manifold approximation and projection (UMAP) and autoencoder neural networks.

Notably, samples from the mesopelagic zone clustered distinctly apart from other samples, and similarly, polar samples were clearly segregated from non-polar ones. We estimated Shapley additive explanations (SHAP) to understand clusters (polar vs. non-polar, mesopelagic vs. surface samples) to understand the factors driving them. We also applied novel principles like (i) topological data analysis, which assesses the topological features of data across various scales, (ii) symbolic regression, capable of identifying concise interpretable human-readable expressions directly from data, (iii) large language models for direct genomic tagging, and (iv) optimal transport to project metagenomic/metatranscriptomic characteristics across different depths.

We also focused on characterizing the different oceanic layers: surface, Deep Chlorophyll Maximum (DCM), and mesopelagic. For this purpose, we trained classification models and utilized SHAP (SHapley Additive exPlanations) values to dissect the variations in the metagenomic, metatranscriptomic, and taxonomic compositions across these layers. Our findings revealed that the primary distinctions between the top two layers (surface and DCM) are linked to aspects of carbon and amino acid metabolism, pathways of carbon fixation and, antimicrobial processes. In contrast, when analyzing the deeper mesopelagic layer, these processes appeared to be significantly different, being noteworthy processes involved in protein folding adaptations to high pressure and low temperature.

Symbolic regression, a cutting-edge machine learning technique that excels in identifying concise, interpretable human-readable expressions directly from data. This method was applied to explore the intricate relationships between environmental and biological variables. Specifically, we trained symbolic regression models to predict the environmental parameters, using features derived from metagenomic, metatranscriptomic, and taxonomic datasets as predictors. The results of this analysis were particularly striking for certain environmental variables. Our models achieved the highest R-squared (R2) coefficients in predicting oxygen levels and temperature. This was followed by accurate predictions for salinity, latitude, water density, and concentrations of silicate, nitrate, and phosphate. These findings underscore the potential of symbolic regression as a powerful tool in environmental genomics, offering valuable insights into the complex interplay between biological and environmental factors in oceanic ecosystems.

With regard to the second effort, we applied machine learning to generate the first spatio-temporal high-resolution model of organic matter abundance in the Ocean with the aim of describing the biological carbon pump at a global scale. A database has been consolidated, gathering satellite data and 8803 vertical profiles particulate organic matter gathered from 2008 to 2020 in different areas of the Ocean. We applied gradient boosting algorithms (XGBoost) combined with UMAP in order to create a global visualization. Special care has been taken to deal with the preparation of the ML training and the test data sets and the hyperparameter sweeps.We ran several experiments varying the input data, the spatio-temporal resolution of the working grid and the model architecture. A serial particle distribution with 10 particle size classes is predicted on a daily scale and a spatial resolution of 0.25 angular degrees, generating a surface particle field from the model, supporting an innovative description of the distribution of particulate organic matter in the Ocean.

In this context, P. Marquet and colleages has published a groundbreaking paper associating temperature and biological functions that is assimilated by different members of the team for in-depth study.

Models and physics-informed methods

There has been a substantial amount of work in the context of physics-informed ML.

A model based on neural ODEs was developed to monitor a phytoplankton biomass evolution inside an experimental photobioreactor. Two models were studied, the first one replaces the growth rate of the microalgae by a neural network, the second one uses a neural network to correct the bias of a kinetic model based on the literature. Mini batch training and learning rate schedulers were tested. We implement our own solver for ordinary differential equation problems, which interpolates the data and solves a set of equations in every step to allow mini-batch training. Both models efficiently reproduced the effect of a light gradient on a population of cell. The work will be submitted to Engineering Applications of Artificial Intelligence. Combining a host-virus model with different SST temperature predictions to represent long-term dynamics in environments revealed the potential for long-term host-virus coexistence, epidemic free or habitat loss states.

We generalized our model to variation in global sea surface temperatures corresponding to current and future seas and show that climate change may differentially influence virus-host dynamics depending on the virus-host pair. Temperature-dependent changes in the infectivity of virus particles may lead to shifts in virus-host habitats in warmer oceans, analogous to projected changes in the habitats of macro-, microorganisms and pathogens (work between Biocore and the LOV).

We addressed the coupling of hyperbolic PDEs with a neural network for the reconstruction of dust transport during the Holocene period. To our knowledge, this is the first time it has been applied in this field. This work has given rise to an article currently being submitted.

We approached the problem of PDE/NN coupling in the thesis of L. Migus with a focus on the identification of a friction term in a Saint-Venant-type system. The idea here is to make a network play the role of the term in a numerical solver, i.e., the network is called up at each time step, taking the flow height and velocity as input variables.

The difficulty lies in learning through a solver, since friction is not directly available; the only observable quantities are height and velocity. We study the resulting degradation and show that the resulting solver is however stable and highly efficient. This work is currently being written. Léon Migus’ thesis was defended in December 2023.

We worked on the coupling of Navier-Stokes and NPZ(D) models to create dynamic models of plankton populations in the ocean.

Finally, we proposed the Multi-Objective Physics-Informed Neural Networks (MOPINNs). MOPINNs applies evolutionary multi-objective AutoML to find the set of trade-offs between the data and physical losses of PINNs by at the same time optimizing the model complexity. It allows practitioners to correctly identify which of these trade-offs better represent the solution they want to reach. This work has received the ``accessit to best paper award’’ at conference IBERAMIA’2022.

page not found

How it works

Oceania-query-fasta is a Python package that is built to make queries in Ocean Microbial Reference Gene Catalog v2 ~100GB (gziped) of FASTA, CSV and TSV files.

Install library oceania-query-fasta

Open complex Demo in Google Colab

See complex Demo in Jupyter NbViewer

View Github Demo code

Download simple Jupyter Notebook

Download complex Jupyter Notebook

Feedback report
pip install oceania-query-fasta
STORAGE_KEY = "data/raw/tara/OM-RGC_v2/assemblies/TARA_A100000171.scaftig.gz"
POSITIONS = [
    ["TARA_A100000171_G_scaffold48_1", 10, 50, "complement"],
    ["TARA_A100000171_G_scaffold48_1", 10, 50],
    ["TARA_A100000171_G_scaffold48_1", 10, 50, "reverse_complement"],
    ["TARA_A100000171_G_scaffold181_1", 0, 50],
    ["TARA_A100000171_G_scaffold181_1", 100, 200],
    ["TARA_A100000171_G_scaffold181_1", 200, 230],
    ["TARA_A100000171_G_scaffold493_2", 54, 76],
    ["TARA_A100000171_G_scaffold50396_2", 87, 105],
    ["TARA_A100000171_G_C2001995_1", 20, 635],
    ["TARA_A100000171_G_C2026460_1", 0, 100],
  ]
results = get_sequences_from_fasta(
    STORAGE_KEY,
    POSITIONS
)
print(results)

Features:

  • Usage from Jupyter, command-line and Python package
  • Queries of Ocean Microbial Reference Gene Catalog (OM-RGC_v2)
  • Support custom queries
  • Output format CSV or FASTA
  • Free Cloud Computing (Serverless)