Heterogeneous datasets: a tale of integration and exploration
This talk presents the different researches that I have conducted since my Bachelor in the heterogeneous data integration area. See my research.
SlidesThis talk presents the different researches that I have conducted since my Bachelor in the heterogeneous data integration area. See my research.
SlidesThis talk presents my PhD work on how to approach semi-structured data as a novice or as a data scientist who has to work with new data. The two main axes are: (a) E-R diagrams built out of any structured or semi-structured dataset ; (b) entity-to-entity path enumeration and ranking.
SlidesThis workshop has been conducted for and with first-year (CPES) undergraduate students. The goal is to provide background in data integration, relational databases, graphs, but also NLP and LLM. The workshop also engaged students to manipulate existing research tools for data integration, such as ConnectionLens and StatCheck.
SlidesThe goal of this forum at CFI (French media development agency) is to present new research directions and tools that journalists could later use in their quest of better information sharing and acquisition. ConnectionStudio has been presented in detail; high interest has been shown during a long question-answering time.
Slides (in French)This talk is aimed at high school female scientists and at the RMJI (young female mathematicians and computer scientists meeting). It covers both data integration challenges and main aspects of being a PhD student and doing research. The former part presents ConnectionLens, a graph-based approach to integrate very heterogeneous data (structured, semi-structured and un-structured) in the context of data journalism. The latter part emphasises the different aspects of doing research.
Slides (in French)