about me

Lidia Contreras Ochando

PhD Student in Computer Science

PhD Student in Computer Science in the DMIP team (a subgroup of ELP) of the Department of Computer Systems and Computation of the Universitat Politènica de València, under the supervision of Cèsar Ferri and José Hernández-Orallo.


MECD FPU Grant (2016-2020): REF FPU15/03219

    My thesis research is focused on the automation of the Data Science process, especially in the data wrangling step.


  • President and co-founder of dataUPV group
  • Founder and co-organizer of R-Ladies Valencia
  • Member of NASA Datanauts 2017 Fall Class

experience

Visiting Scholar - Katholieke Universiteit Leuven

Mar 2018 - Jun 2018

Guest member of the Machine Learning Research Group at Department of Computer Science, under the supervision of Luc De Raedt.

Researcher in Computer Science - Universitat Politècnica de València

Oct 2015 - Current

Member of the DMIP group at Department of Computer Systems and Computation.

Member of the LOBASS Project.

Researcher in Computer Science - Université de Strasbourg

Apr 2016 - Oct 2016

Member of the SDC group at iCube Department (Le laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie), under the supervision of Nicolas Lachiche.

Member of the REFRAME project.

Webmaster (collaboration grant) at Universitat Politècnica de València

Oct 2013 - Oct 2015

Webmaster at School of Informatics.

education

Doctorado en Informática (PhD in Computer Science) - Universitat Politècnica de València

Current

MECD FPU Grant (2016-2020)

Master Universitario en Gestión de la Información (Master's Degree in Information Management) - Universitat Politècnica de València

Sep 2016

Best Master's records award

Best Masther Thesis award

Grado en Ingeniería Informática (Bachelor's Degree in Informatics Engineering) - Universitat Politècnica de València

Sept 2014

Ingeniería Técnica en Informática de Gestión (Technical Engineer in Administrative Data Processing) - Universitat Politècnica de València

Sept 2011

Awards

HackFoorGood Valencia 2018: Project ParticleAI

  • 2nd local prize

Mar 2018

Poster finalist on IV phD Students Meeting UPV

May 2017

Sandalio Miguel - Maria Aparicio award for airVLC project

Nov 2016

Best Master Thesis award - Cátedra de Transparencia y participació

Oct 2016

Best Academic record award (Master)- Universitat Politècnica de València

Oct 2016

HackFoorGood Valencia 2016: Project BikeXplorer

  • 1st local prize
  • 3rd national prize
  • Telefónica's Open Future prize
  • Open Data valencia prize
  • Open Webinars prize

Feb 2016

HackFoorGood Valencia 2015: Project airVLC

  • 3rd local prize
  • Telefónica's Think Big prize
  • Best challenge prize

Apr 2015

Finalist on iConference's Social Media Expo with Transparency Science project

Mar 2015

Publications

How Data Science helps to build Smart Cities: Valencia as a use case

Lidia Contreras-Ochando, Cristina I. Font-Julián, David Nieves, Fernando Martínez-Plumed

Alcoi: Small & Medium Smart Cities 2018, Alcoi (Spain), [PDF]

Domain specific induction for data wrangling automation

Lidia Contreras-Ochando

Joint work of Lidia Contreras-Ochando, Cèsar Ferri, José Hernández-Orallo, Fernando Martínez-Plumed, María José Ramírez-Quintana and Susumu Katayama

Report from Dagstuhl Seminar 17382 Approaches and Applications of Inductive Programming, Dagstuhl (Germany) 2017, [PDF]

Domain specific induction for data wrangling automation (Demo)

Lidia Contreras-Ochando, Cèsar Ferri, José Hernández-Orallo, Fernando Martínez-Plumed, María José Ramírez-Quintana and Susumu Katayama

Automatic Machine Learning Workshop (AutomML) @ ICML 2017, Sídney (Australia), [PDF]

airVLC: An application for visualizing wind-sensitive interpolation of urban air pollution forecasts

Lidia Contreras-Ochando and Cèsar Ferri

IEEE International Conference on Data Mining series (ICDM 2016), Barcelona (Spain), [PDF]

General-Purpose Inductive Programming for Data Wrangling Automation

Lidia Contreras-Ochando, Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo and M. José Ramírez-Quintana

Artificial Intelligence for Data Science (AI4DataSci) @ NIPS 2016, Barcelona (Spain), [PDF]

Logging Data Scientists: Collecting Evidence for Data Science Automation

Lidia Contreras-Ochando, Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo and M. José Ramírez-Quintana

Artificial Intelligence for Data Science (AI4DataSci) @ NIPS 2016, Barcelona (Spain), [PDF]

CASP-DM: Bringing a Context-Aware Standard Process for Data Mining to Data Science

Hana Alouaoui, Lidia Contreras-Ochando, Cèsar Ferri, Peter Flach, José Hernández-Orallo, Meelis Kull, Nicolas Lachiche, Fernando Martínez-Plumed and M. José Ramírez-Quintana

European Data Science Conference (ECDS 2016), Luxembourg

Predicción e interpolación dinámica de los niveles de contaminación atmosférica mediante datos de intensidad de tráfico y dirección del viento

Lidia Contreras Ochando

Master thesis, [PDF]

Cycling network projects: a decision-making aid approach

Fernando Martínez-Plumed, Cèsar Ferri, and Lidia Contreras-Ochando

SoGood 2016 @ ECML 2016, Riva del Garda (Italy), pp 96-109, 2016, [PDF]

Wind-sensitive interpolation of urban air pollution forecasts

Lidia Contreras-Ochando and Cèsar Ferri

International Conference on Computational Science 2016, ICCS 2016, California (USA), [PDF]

Airvlc: An application for real time forecasting urban air pollution

Lidia Contreras-Ochando, Cristina I. Font-Julián, Francisco Contreras-Ochando and Cèsar Ferri

2nd Mining Urban Data Workshop (MUD2) @ ICML 2015, Lille (France), [PDF]

Transparency Science: Retorno de la inversión en investigación: Dónde van realmente los fondos?

Lidia Contreras, Cristina I. Font, Paulina Morillo and Diego Vallejo

XIV Jornadas Españolas de Documentación (FESABID 2015), Gijón (Spain)

TransparencyScience: Return on research investment, where do the funds go?

Lidia Contreras, Cristina I. Font, Paulina Morillo and Diego Vallejo

Social Media Expo @ iConference 2015, California (USA), [PDF]

Posters

Enhancing Inductive Programming by Function Ranking: A Machine Learning Application for Data Wrangling Automation

Spring workshop on Mining and Learning (SMiLe 2018), Black Forest (Germany) [PDF]

How Data Science helps to build Smart Cities: Valencia as a use case

Alcoi: Small & Medium Smart Cities 2018, Alcoi (Spain) [PDF]

Domain specific induction for data wrangling automation

Automatic Machine Learning Workshop (AutomML) @ ICML 2017, Sídney (Australia) [PDF]

Data Science by Demonstration

Summer School on Machine Learning, 2017 (Benicassim, Spain) [PDF]

Data Science by Demonstration

IV Encuentro de Estudiantes de Doctorado UPV (IV Students Meeting UPV), 2017 (Valencia, Spain) [PDF]

General-Purpose Inductive Programming for Data Wrangling Automation

Lidia Contreras-Ochando, Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo and M. José Ramírez-Quintana

Artificial Intelligence for Data Science (AI4DataSci) @ NIPS 2016, Barcelona (Spain), [PDF]

Logging Data Scientists: Collecting Evidence for Data Science Automation

Lidia Contreras-Ochando, Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo and M. José Ramírez-Quintana

Artificial Intelligence for Data Science (AI4DataSci) @ NIPS 2016, Barcelona (Spain), [PDF]

Wind-sensitive interpolation of urban air pollution forecasts

Lidia Contreras-Ochando and Cèsar Ferri

11th Women in Machine Learning Workshop (WiML2016) @ NIPS 2016, Barcelona (Spain), [PDF]

CASP-DM: Bringing a Context-Aware Standard Process for Data Mining to Data Science

Hana Alouaoui, Lidia Contreras-Ochando, Cèsar Ferri, Peter Flach, José Hernández-Orallo, Meelis Kull, Nicolas Lachiche, Fernando Martínez-Plumed and M. José Ramírez-Quintana

European Data Science Conference (ECDS 2016), Luxembourg

Talks

Enhancing Inductive Programming by Function Ranking: A Machine Learning Application for Data Wrangling Automation

Spring workshop on Mining and Learning (SMiLe 2018), Black Forest (Germany)

Domain specific induction for data wrangling automation

Dagstuhl Seminar 17382 Approaches and Applications of Inductive Programming, Dagstuhl (Germany) 2017

Domain specific induction for data wrangling automation

Automatic Machine Learning Workshop (AutomML) @ ICML 2017, Sídney (Australia)

dataUPV

El reto del talento universitario, el empoderamiento y la creación de comunidades: Generación Espontánea

Jornadas Técnicas de RedIRIS 2016, Valencia (Spain)

airVLC: An application for visualizing wind-sensitive interpolation of urban air pollution forecasts (DEMO)

IEEE International Conference on Data Mining series (ICDM 2016), Barcelona (Spain)

Logging Data Scientists: Collecting Evidence for Data Science Automation

Artificial Intelligence for Data Science (AI4DataSci) @ NIPS 2016, Barcelona (Spain)

Introducción a los datos abiertos

La ola que viene

OuiShare Fest Barcelona 2016, Barcelona (Spain)

Introducción a los datos abiertos

Valencia.rb, 2016, Valencia (Spain)

Cycling network projects: a decision-making aid approach

SoGood 2016 @ ECML 2016, Riva del Garda (Italy)

Predicción de contaminación con técnicas de Machine Learning

Segunda reunión de usuarios de R en Valencia

Usuarios R Valencia, 2016, Valencia (Spain)

Data Science for Smart Cities

Women Techmakers 2016, Castellón (Spain)

Teaching

Taller de introducción a la Ciencia de Datos (Data Science introduction workshop)

Jornadas de Ciencia de Datos - dataUPV (Valencia, Spain)

Data Cleaning & analysis process with RapidMiner
2018

Strategic Information Systems (Bachelor's Degree in Computer Science)

Data Science practices with RapidMiner
2017/2018

Taller de Ciencia de Datos. Tú haz las preguntas, los datos responden. (Data Science Workshop)

Forum Lluis Vives (Valencia, Spain)

Data Science process using RapidMiner
2017

Bachelor's & Master's thesis advisor

Enrique Guerrero Muñoz (Bachelor's Degree in Computer Science)

Exploratory Analysis of NASA Open Data

Supervisor: César Ferri
2017/2018

Jesús Sánchez Martínez (Master in Big Data Analytics)

bikeFlow: An Application for predicting bike sharing demand in the city of Valencia

Supervisor: Fernando Martínez Plumed
2017/2018

Projects

Domain Specific Induction for Data Wrangling Automation [website][PDF]

An approach to semi-automate some common transformations that appear in the data wrangling process, which is based on a general purpose inductive programming system extended with domain-specific background knowledge. The web-based tool allows users to provide a set of inputs and one or more examples of outputs, in such a way that a pattern is found that is applied to the rest of examples automatically.


Part of the PhD Thesis of Lidia Contreras-Ochando.

Lidia Contreras-Ochando, César Ferri, José Hernández-Orallo, Susumu Katayama, Fernando Martínez-Plumed, María José Ramírez-Quintana
2016-2018 (Current Project)

ParticleAI [website]

Shiny web application for studying several air particles (pollen, pollution and tiny particles). The application working with a AI central core, shows historical and future information in a visual way. The project includes an virtual assistent (Telegram bot) that helps and alert users on current and future alerts on high levels of particles in cities.


Project presented at the V Hackathon Telefónica HackForGood 2018:

  • 2nd local prize

Fernando Martínez-Plumed, Lidia Contreras-Ochando, Cristina I. Font-Julián, David Nieves Cordones
2018 (Current Project)

BikeFlow [website][PDF]

Improvement of BikeXplorer. We enriched the Valenbisi demand with meteorological data characterizing each station independently. The application allows the user to find historical and real-time information and the estimation of future demand in each station. We also use extrapolation techniques to estimate the demand that a currently non-existent station at any point in the city would have.


Project presented at the II OpenDatathon ETSINF-UPV 2017:

  • 1st prize: Valencia City council Award

Jesús Sánchez-Martínez, Fernando Martínez-Plumed, Lidia Contreras-Ochando
2017

BikeXplorer [website]

An application to predict the demand of Valenbisi stations. We use data about the historical demand of the different Valenbisi stations in the city. The application allows the user to obtain an estimation of the future demand in each Valenbisi station as well as the best route to reach a target station. In addition, the system is able to show the potential use of a new station in any point of the map, depending on the type of neighbourhood and census data.


Project presented at the IV Hackathon Telefónica HackForGood 2016:

  • 1st local prize
  • 3rd national prize
  • Telefónica's Open Future prize
  • Open Data valencia prize
  • Open Webinars prize

Lidia Contreras-Ochando, Cristina I. Font Julián, Fernando Martínez-Plumed
2016

TrafficXplorer [website]

A web application to show and predict traffic levels. The application has three different sections: historical data (showing traffic intensity from one to twelve hours before current hour); real-time data (showing real-time traffic data) and prediction (showing the traffic intensity prediction from one to twelve hours after the current hour).


Project presented at the I OpenDatathon ETSINF-UPV 2016:

  • 1st prize: BigML Award for Best predictive project

Lidia Contreras-Ochando, Fernando Martínez-Plumed
2016

MoreBikeLines [website]

An approach to design and calculate bike lane networks based on the use of open data about the historical use of a urban bike rental services. Concretely, we model this task as a network design problem (NDP) and we study four different optimisation strategies to solve it. The proposed method can be easily used to improve or extend bike lane networks based on historic bike use data in other cities.


Project presented at the I OpenDatathon ETSINF-UPV 2016:

  • Finalist: Special award Valencia City

Lidia Contreras-Ochando, Fernando Martínez-Plumed
2016

airVLC [website][PDF]

The data collected related with air pollution is published with a three-hour delay, making difficult to prevent high levels. airVLC is an application that predicts the levels of pollution by employing traffic and meteorological data in real-time, providing different maps to show the level of pollution in each point of the city by using wind-based spatial interpolation techniques.


Master Thesis of Lidia Contreras-Ochando:

  • Best Masther Thesis award

2016

Project presented at the III Hackathon Telefónica HackForGood 2015:

  • 3rd local prize
  • Telefónica's Think Big prize
  • Best challenge prize

Lidia Contreras-Ochando, Francisco Contreras-Ochando, Cristina I. Font-Julián
2015

Transparency Science [website][PDF]

The aim of this project is to provide reliable information about public investment in science. Data related to the Spanish state budget is collected from several public open sources. Once processed, the knowledge extracted is shown in such a way to facilitate maximum understanding of citizens, as well as their participation by means of voting systems.


Project presented at the iConference 2015:

  • Finalist: Social Media Expo 2015

Lidia Contreras-Ochando, Cristina I. Font-Julián, Paulina Morillo, Diego Vallejo
2015

contact info

Universitat Politècnica de València (UPV)
Department of Computer Systems and Computation (DSIC)
Building 1F (Room L201)
Camino de Vera s/n, 46020, Valencia
Spain

Current address:
Katholieke Universiteit Leuven (KU Leuven)
Department of Computer Science
Celestijnenlaan 200A (Room 02.65)
3001 Heverlee
Belgium