Follow
Dustin Lange
Dustin Lange
Stealth mode (ex Amazon)
Verified email at sent.com
Title
Cited by
Cited by
Year
Automating large-scale data quality verification
S Schelter, D Lange, P Schmidt, M Celikel, F Biessmann, A Grafberger
Proceedings of the VLDB Endowment 11 (12), 1781-1794, 2018
1622018
Probabilistic demand forecasting at scale
JH Böse, V Flunkert, J Gasthaus, T Januschowski, D Lange, D Salinas, ...
Proceedings of the VLDB Endowment 10 (12), 1694-1705, 2017
1112017
Extracting structured information from Wikipedia articles to populate infoboxes
D Lange, C Böhm, F Naumann
Proceedings of the 19th ACM international conference on Information and …, 2010
852010
" Deep" Learning for Missing Value Imputationin Tables with Non-numerical Data
F Biessmann, D Salinas, S Schelter, P Schmidt, D Lange
Proceedings of the 27th ACM International Conference on Information and …, 2018
732018
DataWig: Missing Value Imputation for Tables.
F Biessmann, T Rukat, P Schmidt, P Naidu, S Schelter, A Taptunov, ...
J. Mach. Learn. Res. 20 (175), 1-6, 2019
662019
Cross-lingual entity matching and infobox alignment in Wikipedia
D Rinser, D Lange, F Naumann
Information Systems 38 (6), 887-907, 2013
632013
Efficient similarity search in very large string sets
D Fenz, D Lange, A Rheinländer, F Naumann, U Leser
Scientific and Statistical Database Management: 24th International …, 2012
302012
Reach for gold: An annealing standard to evaluate duplicate detection results
T Vogel, A Heise, U Draisbach, D Lange, F Naumann
Journal of Data and Information Quality (JDIQ) 5 (1-2), 1-25, 2014
182014
Efficient Similarity Search: Arbitrary Similarity Measures, Arbitrary Composition
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
182011
Frequency-aware similarity measures: why Arnold Schwarzenegger is always a duplicate
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
152011
Automated data validation in machine learning systems
F Biessmann, J Golebiowski, T Rukat, D Lange, P Schmidt
122021
Differential data quality verification on partitioned data
S Schelter, S Grafberger, P Schmidt, T Rukat, M Kiessling, A Taptunov, ...
2019 IEEE 35th International Conference on Data Engineering (ICDE), 1940-1945, 2019
122019
Unit testing data with deequ
S Schelter, F Biessmann, D Lange, T Rukat, P Schmidt, S Seufert, ...
Proceedings of the 2019 International Conference on Management of Data, 1993 …, 2019
102019
Deequ-data quality validation for machine learning pipelines
S Schelter, P Schmidt, T Rukat, M Kiessling, A Taptunov, F Biessmann, ...
82018
Towards automated data quality management for machine learning
T Rukat, D Lange, S Schelter, F Biessmann
ML Ops Work. Conf. Mach. Learn. Syst, 1-3, 2020
72020
Towards automated ML model monitoring: Measure, improve and quantify data quality
T Rukat, D Lange, S Schelter, F Biessmann
52020
Cost-aware query planning for similarity search
D Lange, F Naumann
Information Systems 38 (4), 455-469, 2013
52013
An interpretable latent variable model for attribute applicability in the amazon catalogue
T Rukat, D Lange, C Archambeau
arXiv preprint arXiv:1712.00126, 2017
42017
Projektseminar „Similarity Search Algorithms “
D Lange, T Vogel, U Draisbach, F Naumann
Datenbank-Spektrum, 1-7, 2011
22011
Effective and efficient similarity search in databases
D Lange
Universität Potsdam, 2013
12013
The system can't perform the operation now. Try again later.
Articles 1–20