Data Science Education
Unravelling the Skill Sets of Data Scientists: A Text Mining Analysis of Dutch University Master Programs in Data Science and Artificial Intelligence | |
zsuzsa bakk |
Functional Data Analysis
A quantile extension to functional PCA | |
Alvaro Mendez Civieta |
Applying classification methods for multivariate functional data | |
Tomasz Górecki, Mirosław Krzyśko, Waldemar Wołyński |
Model-Based Clustering
Gaussian mixture models for changepoint detection | |
Sanjeena Subedi, Utkarsh Dang |
Clustering of human gut microbiome data using the finite mixture of generalized Dirichlet-multinomial models | |
Xiaoke Qin |
Mixture Multigroup Structural Equation Modeling: Comparing Structural Relations Across Many Groups | |
Andres Felipe Perez Alonso, Yves Rosseel, Jeroen Vermunt, Kim De Roover |
Finite Mixture Models for an underlying Beta distribution with an application to COVID-19 data | |
Jang Schiltz, Cédric Noel |
Drift-switching local level models for time series segmentation | |
Allou SAME |
A Clustering Procedure for Three-Way RNA Sequencing Data Using Data Transformations and Matrix-Variate Gaussian Mixture Models | |
Theresa Scharl |
A multivariate functional data clustering method using parsimonious cluster weighted models | |
Cristina Adela Anton |
Mixtures of Quantile-based Factor Analyzers | |
Edoardo Redivo |
Optimization in Classification and Clustering
Machine Learning-Based Classification and Prediction to Assess Corrosion Degradation in Mining Pipelines | |
Kalidou Moussa Sow, Nadia GHAZALLI |
Towards Topologically Diverse Probabilistic Planning Benchmarks: Synthetic Domain Generation for Markov Decision Processes | |
Jaël Champagne Gareau, Éric Beaudry, Vladimir Makarenkov |
Spatial Data Analysis
Spatial Agent-Based Model for Aedes Aegypti mosquitoes in the urban area in Arica (Chile) | |
Diana Marcela Martinez, Kerlyns Martínez, Daira Velandia, Débora Buendía, Scarlett Lever, María Elízabeth Guerra, Ximena Collao, Rodrigo Salas |
Text Mining
Combining Topic Modeling and Word Embedding to Predict Match Outcomes in Association Football | |
Sourav Adhikari |
Visualization and Clustering with Projective Techniques | |
Stephen L. France |
AI for Government: Methods and Data Analysis
Unsupervised Detection of Anomaly in Public Procurement Processes | |
Jose Pablo Arroyo-Castro, Shu Wei Chou-Chen |
Big Data
Understanding omics links behind glioma heterogeneity: a network and clustering approach | |
Marta B Lopes |
Clustering, Classification and Discrimination
A Deterministic Information Bottleneck Method for Clustering Mixed-Type Data | |
Efthymios Costa, Ioanna Papatsouma, Angelos Markos |
Model-based bi-clustering using multivariate Poisson-lognormal with general block-diagonal covariance matrix and its applications Submitted to IFCS 2024 Book of Abstracts | |
Caitlin Kral |
Clustering for High-Dimensional, Nested Data with Categorical Outcomes Using a Generalized Linear Mixed Effects Model with Simultaneous Variable Selection | |
Samantha Manning |
Weighted Consensus Clustering for Unbiased Feature Importance in Random Forests | |
Ndèye NIANG |
Randomly perturbed random forests | |
Laura Anderlucci, Angela Montanari |
A toolbox for clustering ordinal data in the presence of missing values | |
Lena Ortega Menjivar |
Multimodal Emotion Recognition: A comparative study | |
Ayemn Gondech, Hélène Tran, Issam Falih, Xavier Goblet, Engelbert Mephu Nguifo |
Two clustering strategies for structural composite models with PLS-SEM | |
Véronique Cariou |
P-value Adjusted Selected Tree Ensemble | |
Joshua Pooley, Berthold Lausen, Osama Mahmoud, Henrik Nordmark, Ivan Schuch |
Data Science
Riemannian Statistics for Any Type of Data | |
Oldemar Rodriguez Rojas |
Dimension Reduction
Optimal penalized sparse PCA | |
Rosember Isidoro Guerra Urzola |
Multiblock Regularized Least-squares Latent Variable Method | |
Thu Tra Le |
Generative Artificial Intelligence, Explainable Artificial Intelligence (XAI)
Innovating the banking with machine learning: Credit Score for MSMEs | |
Tatiana Quirós Muñoz, Álvaro Guevara Villalobos |
Image Analysis and Computer Vision
Integration of Deep Learning and Marketing Research for Brand Confusion Prediction and Visual ad Analysis | |
Atsuho Nakayama |
Machine Learning
Improving Employee Attrition with Data Analysis and Machine Learning | |
Sergio Ramirez Rodriguez, Alvaro Guevara Villalobos |
Machine learning-driven COVID-19 early triage and large-scale testing strategies based on the 2021 Costa Rican Actualidades survey | |
Carlos Pasquier, Maikol Solís, Vivian Vilchez, Santiago Núñez-Corrales |
Predicting soil bacterial and fungal communities at different taxonomic levels using machine learning | |
Vladimir Makarenkov, Zahia Aouabed, Mohamed Achraf Bouaoune, Mohamed Hijri |
Modelling High-Dimensional and Complex Data
UMAP projections and the survival of empty space: A geometric approach to high-dimensional data | |
Maikol Solís, Alberto Hernández |
Statistical and Econometric Methods
Optimization Strategies for Bioprocess Parameterization: A Comparative Evaluation | |
Matthias Medl |
Time Series Analysis
Predicting Air Pollution in Beijing, China Using Chemical, and Climate Variables | |
Joshua Isaac Cervantes Artavia, Moisés De Jesús Monge Cordonero, Daniel Josué Sabater Guzmán |
Modelling clusters in network time series with an application to presidential elections in the USA | |
Guy Nason, Daniel Salnikov, Mario Cortina-Borja |
|*|Advances in supervised classification
A comparison of multivariate mixed models and generalized estimation equations models for discrimination in multivariate longitudinal data | |
Tolulope Sajobi |
Statistics in the Knowledge Economy | |
David Banks |
A Gene Selection Method for Classification with Three Classes Using Proportional Overlapping Scores | |
Anusa Suwanwong, Andrew Harrison, Osama Mahmoud |
Scalable conic optimization for feature selection in linear SVMs with cardinality control | |
Immanuel M. Bomze, Federico D'Onofrio, Bo Peng, Laura Palagi |
|*|Classification Methods for Large Datasets (Organized by Aurea Grané)
Robust distance-based generalized linear models: A new tool for classification | |
Eva Boj, Aurea Grane, Agustín Mayo-Íscar |
High-dimensional survival analysis: exploring Cox regression with lasso and adaptive lasso penalties | |
Pilar González-Barquero |
Using polynomials to explain classification outputs from neural networks | |
Pablo Morala |
|*|Clustering, Classification and Discrimination (Organized by Aurea Grané)
A new distance for categorical data with moderate association | |
Aurea Grané, Silvia Salini, Gabriele Infante |
Unsupervised methods for the creation of orthonormal bases in compositional data: R-mode clustering | |
Jose Antonio Martin Fernandez |
A multivariate approach for clustering functional data in one and multiple dimensions | |
Belén Pulido |
Analysis of seawater nutrient concentrations to assess Submarine Groundwater Discharge along the Catalan coast (NW Mediterranean): a Compositional Data Analysis Approach | |
M.I. Ortego |
Fuzzy Clustering of Attributed Networks | |
lazhar labiod, Mohamed Nadif |
An efficient multicore CPU implementation of the DatabionicSwarm | |
Quirin Stier |
|*|Data Science in Economics, Finance and Management (Organized by K. Jajuga)
On the Vapnik-Chervonenkis Dimension and Learnability of the Hurwicz Decision Criterion | |
Manuel Nunez, Mark Schneider |
Robust estimation of the range-based GARCH model: Forecasting volatility, value at risk and expected shortfall of cryptocurrencies | |
Marta Małecka |
A Spectral Approach to Evaluating VaR Forecasts: Stock Market Evidence from the Subprime Mortgage Crisis, through COVID-19, to the Russo-Ukrainian War | |
Marta Malecka, Radosław Pietrzyk |
Green bond yield determination with the use of machine learning methods. Comparison with conventional bonds | |
Katarzyna Ewa Kuziak, Klaudia Kaczmarczyk, Caner Colak |
|*|Data Science in Social and Political Research
Mapping Electoral Behavior and Political Competition: A Comparative Analytical Framework for Voter Typologies and Political Discourses | |
Georgia Panagiotidou, Theodore Chadjipadelis |
Candidates, Parties, Issues and the Political Marketing Strategies: A Comparative Analysis on political competition in Greece | |
VASILIKI BOURANTA, GEORGIA PANAGIOTIDOU, THEODORE CHADJIPADELIS |
Gender Bias Mitigation in a Credit Scoring Model | |
Ricardo Corrales-Barquero |
Crime in Mexico: an original Data Analysisapproach | |
Maria Teresa Guerrero-San Vicente, Carlos Cuevas-Covarrubias |
|*|Functional Data Analysis (Organized by Prof. Rosanna Verde)
A new metric to classify B cell lineage tree | |
Nadia Tahiri |
|*|Health Data Science
TabText: A Flexible and Contextual Approach to Tabular Data Representation | |
Kimberly Villalobos Carballo |
|*|Modeling Multivariate Data (Organized by Prof. Anuradha Roy)
Classifying multivariate observations in data sets with asymmetric features and outlying observations | |
Brian Franczak |
Multiblock Methods for Learning Structural Equation Models: An Overview | |
Alba Martinez-Ruiz |
A Comparison of Multivariate Mixed Models and Generalized Estimation Equations Models for Discrimination in Multivariate Longitudinal Data | |
Tolulope Sajobi |
|*|Multidimensional data visualization (Organized by Johané Nienkemper & Sugnet Lubbe)
Reduced Rank Regression with Mixed Predictors and Mixed Responses | |
Mark de Rooij |
Model Selection for Linear Regression Under Data Aggregation | |
Pieter C. Schoonees |
Nearest neighbors for mixed type data: an inter-dependency based approach | |
Alfonso Iodice D'Enza, Carlo Cavicchia, Michel van de Velden, Angelos Markos |
|*|Pattern Recognition
Pattern Recognition for Mexican Household Power Demand Time Series | |
José Asse Amiga |
|*|Statistical Learning and Data Mining
Bridge the gap between Gradual Patterns and Statistical Correlations | |
Engelbert MEPHU NGUIFO |
|*|Symbolic Data Analysis (Organized by Prof. Rosanna Verde)
Distributional-based Partitioning with Copulas | |
Wenhao Pan, Lynne Billard |
A fuzzy clustering algorithm with entropy regularization for interval-valued data | |
Francisco de Assis Tenorio de Carvalho |
Symbolic Data Analysis Framework for Recommendation Systems: SDA-RecSys | |
Pushya Chaparala, Nagabhushan P |
Network analysis approach to the analysis of event sequences | |
Vladimir Batagelj, Anuška Ferligoj |
Spatio-temporal hierarchical clustering of interval time series with application to suicide rates in Europe | |
Raffaele Mattera, Philip Hans Franses |
Principal Components Analysis of Histogram-valued Data: Set Theory Approach | |
Jorge Arce, Oldemar Rodriguez Rojas |
Expected Size of Random Fuzzy Concep Lattices | |
Richard EMILION |
A Robust approach of the Clusterwise Regression method for distributional data | |
Rosanna Verde, Gianmarco Borrata, Antonio Balzanella, Francisco de A. T. de Carvalho |
Quality Measures for Clusterwise Regression | |
Paula Brito |
A Robust approach of the Clusterwise Regression method for distributional data | |
Rosanna Verde, Gianmarco Borrata, Antonio Balzanella, Francisco de A. T. de Carvalho |
Hypothesis Testing of Mean Interval for p-dimensional Interval-valued Data | |
Anuradha Roy, Fernando Montes |