2024 |
Adilova, Linara; Andriushchenko, Maksym; Fischer, Michael Kamp Asja; Jaggi, Martin Layer-wise Linear Mode Connectivity (Proceedings Article) In: International Conference on Learning Representations (ICLR), Curran Associates, Inc, 2024. (Abstract | Links | BibTeX | Tags: deep learning, layer-wise, linear mode connectivity) @inproceedings{adilova2024layerwise, Averaging neural network parameters is an intuitive method for fusing the knowledge of two independent models. It is most prominently used in federated learning. If models are averaged at the end of training, this can only lead to a good performing model if the loss surface of interest is very particular, i.e., the loss in the exact middle between the two models needs to be sufficiently low. This is impossible to guarantee for the non-convex losses of state-of-the-art networks. For averaging models trained on vastly different datasets, it was proposed to average only the parameters of particular layers or combinations of layers, resulting in better performing models. To get a better understanding of the effect of layer-wise averaging, we analyse the performance of the models that result from averaging single layers, or groups of layers. Based on our empirical and theoretical investigation, we introduce a novel notion of the layer-wise linear connectivity, and show that deep networks do not have layer-wise barriers between them. We analyze additionally the layer-wise personalization averaging and conjecture that in particular problem setup all the partial aggregations result in the approximately same performance. |
Yang, Fan; Bodic, Pierre Le; Kamp, Michael; Boley, Mario Orthogonal Gradient Boosting for Interpretable Additive Rule Ensembles (Proceedings Article) In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS), 2024. (Abstract | BibTeX | Tags: complexity, explainability, interpretability, interpretable, machine learning, rule ensemble, rule mining, XAI) @inproceedings{yang2024orthogonal, Gradient boosting of prediction rules is an efficient approach to learn potentially interpretable yet accurate probabilistic models. However, actual interpretability requires to limit the number and size of the generated rules, and existing boosting variants are not designed for this purpose. Though corrective boosting refits all rule weights in each iteration to minimise prediction risk, the included rule conditions tend to be sub-optimal, because commonly used objective functions fail to anticipate this refitting. Here, we address this issue by a new objective function that measures the angle between the risk gradient vector and the projection of the condition output vector onto the orthogonal complement of the already selected conditions. This approach correctly approximates the ideal update of adding the risk gradient itself to the model and favours the inclusion of more general and thus shorter rules. As we demonstrate using a wide range of prediction tasks, this significantly improves the comprehensibility/accuracy trade-off of the fitted ensemble. Additionally, we show how objective values for related rule conditions can be computed incrementally to avoid any substantial computational overhead of the new method. |
2023 |
Kamp, Michael; Fischer, Jonas; Vreeken, Jilles Federated Learning from Small Datasets (Proceedings Article) In: International Conference on Learning Representations (ICLR), 2023. (BibTeX | Tags: black-box, black-box parallelization, daisy, daisy-chaining, FedDC, federated learning, small, small datasets) @inproceedings{kamp2023federated, |
Mian, Michael Kamp David Kaltenpoth Osman Nothing but Regrets - Privacy-Preserving Federated Causal Discovery (Proceedings Article) In: International Conference on Artificial Intelligence and Statistics (AISTATS), 2023. (BibTeX | Tags: causal discovery, causality, explainable, federated, federated causal discovery, federated learning, interpretable) @inproceedings{mian2022nothing, |
Mian, Osman; Kamp, Michael; Vreeken, Jilles Information-Theoretic Causal Discovery and Intervention Detection over Multiple Environments (Proceedings Article) In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023. (BibTeX | Tags: causal discovery, causality, federated, federated causal discovery, federated learning, intervention) @inproceedings{mian2023informationb, |
Adilova, Linara; Chen, Siming; Kamp, Michael Informed Novelty Detection in Sequential Data by Per-Cluster Modeling (Proceedings Article) In: ICML workshop on Artificial Intelligence & Human Computer Interaction, 2023. (BibTeX | Tags: ) @inproceedings{adilova2023informed, |
Adilova, Linara; Abourayya, Amr; Li, Jianning; Dada, Amin; Petzka, Henning; Egger, Jan; Kleesiek, Jens; Kamp, Michael FAM: Relative Flatness Aware Minimization (Proceedings Article) In: ICML workshop on Topology, Algebra, and Geometry in Machine Learning Workshop, 2023. (BibTeX | Tags: ) @inproceedings{adilova2023fam, |
Adilova, Linara; Kamp, Michael; Andrienko, Gennady; Andrienko, Natalia Re-interpreting rules interpretability (Journal Article) In: International Journal of Data Science and Analytics, pp. 1–21, 2023. (BibTeX | Tags: ) @article{adilova2023re, |
Li, Jianning; Ferreira, André; Puladi, Behrus; Alves, Victor; Kamp, Michael; Kim, Moon; Nensa, Felix; Kleesiek, Jens; Ahmadi, Seyed-Ahmad; Egger, Jan Open-source skull reconstruction with MONAI (Journal Article) In: SoftwareX, vol. 23, pp. 101432, 2023. (BibTeX | Tags: ) @article{li2023open, |
2022 |
Michael Kamp Amr Abourayya, Erman Ayday AIMHI: Protecting Sensitive Data through Federated Co-Training (Workshop) 2022. (Links | BibTeX | Tags: aimhi, co-training, deep learning, federated learning, privacy) @workshop{abourayya2022aimhi, |
Mian, Osman; Kaltenpoth, David; Kamp, Michael Regret-based Federated Causal Discovery (Proceedings Article) In: The KDD'22 Workshop on Causal Discovery, pp. 61–69, PMLR 2022. (BibTeX | Tags: ) @inproceedings{mian2022regret, |
Li, Jianning; Ferreira, André; Puladi, Behrus; Alves, Victor; Kamp, Michael; Kim, Moon-Sung; Nensa, Felix; Kleesiek, Jens; Ahmadi, Seyed-Ahmad; Egger, Jan Open-Source Skull Reconstruction with MONAI (Journal Article) In: arXiv preprint arXiv:2211.14051, 2022. (BibTeX | Tags: ) @article{li2022open, |
Wang, Junhong; Li, Yun; Zhou, Zhaoyu; Wang, Chengshun; Hou, Yijie; Zhang, Li; Xue, Xiangyang; Kamp, Michael; Zhang, Xiaolong; Chen, Siming When, Where and How does it fail? A Spatial-temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving (Journal Article) In: IEEE Transactions on Visualization and Computer Graphics, 2022. (BibTeX | Tags: ) @article{wang2022and, |
Mian, Osman; Kaltenpoth, David; Kamp, Michael Regret-based Federated Causal Discovery (Proceedings Article) In: The KDD'22 Workshop on Causal Discovery, pp. 61–69, PMLR 2022. (BibTeX | Tags: ) @inproceedings{mian2022regretb, |
2021 |
Linsner, Florian; Adilova, Linara; Däubener, Sina; Kamp, Michael; Fischer, Asja Approaches to Uncertainty Quantification in Federated Deep Learning (Workshop) Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2021, vol. 2, Springer, 2021. (Links | BibTeX | Tags: federated learning, uncertainty) @workshop{linsner2021uncertainty, |
Li, Xiaoxiao; Jiang, Meirui; Zhang, Xiaofei; Kamp, Michael; Dou, Qi FedBN: Federated Learning on Non-IID Features via Local Batch Normalization (Proceedings Article) In: Proceedings of the 9th International Conference on Learning Representations (ICLR), 2021. (Abstract | Links | BibTeX | Tags: batch normalization, black-box parallelization, deep learning, federated learning) @inproceedings{li2021fedbn, The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy. In most cases, the assumption of independent and identically distributed samples across local clients does not hold for federated learning setups. Under this setting, neural network training performance may vary significantly according to the data distribution and even hurt training convergence. Most of the previous work has focused on a difference in the distribution of labels or client shifts. Unlike those settings, we address an important problem of FL, e.g., different scanners/sensors in medical imaging, different scenery distribution in autonomous driving (highway vs. city), where local clients store examples with different distributions compared to other clients, which we denote as feature shift non-iid. In this work, we propose an effective method that uses local batch normalization to alleviate the feature shift before averaging models. The resulting scheme, called FedBN, outperforms both classical FedAvg, as well as the state-of-the-art for non-iid data (FedProx) on our extensive experiments. These empirical results are supported by a convergence analysis that shows in a simplified setting that FedBN has a faster convergence rate than FedAvg. Code is available at https://github.com/med-air/FedBN. |
Petzka, Henning; Kamp, Michael; Adilova, Linara; Sminchisescu, Cristian; Boley, Mario Relative flatness and generalization (Journal Article) In: Advances in Neural Information Processing Systems, vol. 34, pp. 18420–18432, 2021. (BibTeX | Tags: ) @article{petzka2021relative, |
Publications
2024 |
Layer-wise Linear Mode Connectivity (Proceedings Article) In: International Conference on Learning Representations (ICLR), Curran Associates, Inc, 2024. |
Orthogonal Gradient Boosting for Interpretable Additive Rule Ensembles (Proceedings Article) In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS), 2024. |
2023 |
Federated Learning from Small Datasets (Proceedings Article) In: International Conference on Learning Representations (ICLR), 2023. |
Nothing but Regrets - Privacy-Preserving Federated Causal Discovery (Proceedings Article) In: International Conference on Artificial Intelligence and Statistics (AISTATS), 2023. |
Information-Theoretic Causal Discovery and Intervention Detection over Multiple Environments (Proceedings Article) In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023. |
Informed Novelty Detection in Sequential Data by Per-Cluster Modeling (Proceedings Article) In: ICML workshop on Artificial Intelligence & Human Computer Interaction, 2023. |
FAM: Relative Flatness Aware Minimization (Proceedings Article) In: ICML workshop on Topology, Algebra, and Geometry in Machine Learning Workshop, 2023. |
Re-interpreting rules interpretability (Journal Article) In: International Journal of Data Science and Analytics, pp. 1–21, 2023. |
Open-source skull reconstruction with MONAI (Journal Article) In: SoftwareX, vol. 23, pp. 101432, 2023. |
2022 |
AIMHI: Protecting Sensitive Data through Federated Co-Training (Workshop) 2022. |
Regret-based Federated Causal Discovery (Proceedings Article) In: The KDD'22 Workshop on Causal Discovery, pp. 61–69, PMLR 2022. |
Open-Source Skull Reconstruction with MONAI (Journal Article) In: arXiv preprint arXiv:2211.14051, 2022. |
When, Where and How does it fail? A Spatial-temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving (Journal Article) In: IEEE Transactions on Visualization and Computer Graphics, 2022. |
Regret-based Federated Causal Discovery (Proceedings Article) In: The KDD'22 Workshop on Causal Discovery, pp. 61–69, PMLR 2022. |
2021 |
Approaches to Uncertainty Quantification in Federated Deep Learning (Workshop) Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2021, vol. 2, Springer, 2021. |
FedBN: Federated Learning on Non-IID Features via Local Batch Normalization (Proceedings Article) In: Proceedings of the 9th International Conference on Learning Representations (ICLR), 2021. |
Relative flatness and generalization (Journal Article) In: Advances in Neural Information Processing Systems, vol. 34, pp. 18420–18432, 2021. |