Helge Holzmann
Helge, Dr. (PhD)
Holzmann

I am a software developer, researcher and consultant, located in Hannover, Germany, have a PhD in Computer Science, and work as Web Data Engineer for the Internet Archive. My focus is on the effective and efficient use of web archives and related topics, such as big data processing and information retrieval.

Helge Holzmann
Helge, Dr. (PhD)
Holzmann

I am a software developer, researcher and consultant, located in Hannover, Germany, have a PhD in Computer Science, and work as Web Data Engineer for the Internet Archive. My focus is on the effective and efficient use of web archives and related topics, such as big data processing and information retrieval.

Publications

2022

H. Holzmann, N. Ruest, J. Bailey, A. Dempsey, S. Fritz, P. Lee and I. Milligan. ABCDEF: the 6 key features behind scalable, multi-tenant web archive processing with ARCH: archive, big data, concurrent, distributed, efficient, flexible. 22nd ACM/IEEE Joint Conference on Digital Libraries (JCDL). Cologne, Germany. June 2022. best paper nominee

H. Holzmann, N. Ruest, J. Bailey, A. Dempsey, S. Fritz, I. Milligan and K. Willis. Arch-It. Workshop on Web Archiving and Digital Libraries (WADL). Virtual. June 2022.

2021

book H. Holzmann and W. Nejdl. A Holistic View on Web Archives. The Past Web: Exploring Web Archives. 2021. chapter

2019

S. Wildemann and H. Holzmann. Towards Temporal URI Collections for Named Entities. 19th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). Urbana-Champaign, Illinois, USA. June 2019.

demo S. Wildemann and H. Holzmann. Tempurion: A Collaborative Temporal URI Collection for Named Entities. 19th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). Urbana-Champaign, Illinois, USA. June 2019.

phd thesis H. Holzmann. Concepts and Tools for the Effective and Efficient Use of Web Archives. L3S Research Center @ Leibniz University of Hannover (L3S @ LUH). Hannover, Germany. February 2019.

2018

H. Holzmann, A. Anand and M. Khosla. Delusive PageRank in Incomplete Graphs. COMPLEX NETWORKS. Cambridge, UK. December 2018.

poster A. Hoppe, J. Hagen, H. Holzmann, G. Kniesel and R. Ewerth. An Analytics Tool for Exploring Scientific Software and Related Publications. 22th International Conference on Theory and Practice of Digital Libraries (TPDL). Porto, Portugal. September 2018.

H. Holzmann, A. Anand and M. Khosla. What the HAK? Estimating Ranking Deviations in Incomplete Graphs. 14th International Workshop on Mining and Learning with Graphs @ 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (MLG @ KDD). London, UK. August 2018.

journal P. Fafalios, H. Holzmann, V. Kasturia and W. Nejdl. Building and Querying Semantic Layers for Web Archives (Extended Version). International Journal on Digital Libraries (IJDL). July 2018.

short H. Holzmann, and M. Runnwerth. Micro Archives as Rich Digital Object Representations. 10th International ACM Conference on Web Science (WebSci). Amsterdam, Netherlands. May 2018.

2017

short H. Holzmann, E. Novak Gustainis and V. Goel. Universal Distant Reading through Metadata Proxies with ArchiveSpark. 5th IEEE International Conference on Big Data (BigData). Boston, MA, USA. December 2017.

H. Holzmann, W. Nejdl and A. Anand. Exploring Web Archives Through Temporal Anchor Texts. 7th International ACM Conference on Web science (WebSci). Troy, NY, USA. June 2017.

P. Fafalios, H. Holzmann, V. Kasturia and W. Nejdl. Building and Querying Semantic Layers for Web Archives. 17th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). Toronto, Ontario, Canada. June 2017. best paper nominee

H. Holzmann and T. Risse. Accessing Web Archives from Different Perspectives with Potential Synergies. 2nd International Conference on Web Archives / Web Archiving Week (RESAW/IIPC). London, UK. June 2017.

2016

H. Holzmann, W. Sperber and M. Runnwerth. Archiving Software Surrogates on the Web for Future Reference. 20th International Conference on Theory and Practice of Digital Libraries (TPDL). Hannover, Germany. September 2016. among top10 best papers

short H. Holzmann, W. Nejdl and A. Anand. On the Applicability of Delicious for Temporal Search on Web Archives. 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). Pisa, Italy. July 2016.

H. Holzmann, M. Runnwerth and W. Sperber. Linking Mathematical Software in Web Archives. 5th International Congress on Mathematical Software (ICMS). Berlin, Germany. July 2016.

H. Holzmann, V. Goel and A. Anand. ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation. 16th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). Newark, New Jersey, USA. June 2016. best paper nominee

H. Holzmann, W. Nejdl and A. Anand. The Dawn of Today's Popular Domains: A Study of the Archived German Web over 18 Years. 16th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). Newark, New Jersey, USA. June 2016.

demo H. Holzmann and A. Anand. Tempas: Temporal Archive Search Based on Tags. 25th International Conference Companion on World Wide Web (WWW). Montreal, Quebec, Canada. April 2016.

2015

T. Souza, E. Demidova, T. Risse, H. Holzmann, G. Gossen and J. Szymanski. Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives. 1st COST Action IC1302 International KEYSTONE Conference (IKC). Coimbra, Portugal. September 2015.

journal H. Holzmann, N. Tahmasebi and T. Risse. Named Entity Evolution Recognition on the Blogosphere. International Journal on Digital Libraries, Volume 15, Issue 2 (IJDL). April 2015.

2014

H. Holzmann and T. Risse. Insights into Entity Name Evolution on Wikipedia. 15th International Conference on Web Information Systems Engineering (WISE). Thessaloniki, Greece. October 2014.

demo H. Holzmann and T. Risse. Extraction of Evolution Descriptions from the Web. 14th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL / DL). London, UK. September 2014.

journal E. Demidova, N. Barbieri, S. Dietze, A. Funk, H. Holzmann, D. Maynard, N. Papailiou, W. Peters, T. Risse and D. Spiliotopoulos. Analysing and Enriching Focused Semantic Web Archives for Parliament Applications. Future Internet, Volume 6, Issue 3. July 2014.

poster H. Holzmann and T. Risse. Named Entity Evolution Analysis on Wikipedia. 6th International ACM Conference on Web Science (WebSci). Bloomington, IN, USA. June 2014.

journal R. Pham, H. Holzmann, K. Schneider and C. Brüggemann. Tailoring video recording to support efficient GUI testing and debugging. Software Quality Journal, Volume 22, Issue 2. June 2014.

2013

H. Holzmann, N. Tahmasebi and T. Risse. BlogNEER: Applying Named Entity Evolution Recognition on the Blogosphere. 3rd International Workshop on Semantic Digital Archives @ 17th International Conference on Theory and Practice of Digital Libraries (SDA @ TPDL). Valetta, Malta. September 2013.

2012

N. Tahmasebi, G. Gossen, N. Kanhabua, H. Holzmann and T. Risse. NEER: An Unsupervised Method for Named Entity Evolution Recognition. 24th International Conference on Computational Linguistics (COLING). Mumbai, India. December 2012.

demo H. Holzmann, G. Gossen and N. Tahmasebi. fokas: Formerly Known As - A Search Engine Incorporating Named Entity Evolution. 24th International Conference on Computational Linguistics (COLING). Mumbai, India. December 2012.

R. Pham, H. Holzmann, K. Schneider and C. Brüggemann. Beyond plain Video-Recording of GUI-Tests - Linking Test Case Instructions with Visual Response Documentation. 7th IEEE/ACM International Workshop on Automation of Software Test @ 34th International Conference on Software Engineering (AST @ ICSE). Zurich, Switzerland. June 2012.


Helge Holzmann
Helge, Dr. (PhD)
Holzmann

I am a software developer, researcher and consultant, located in Hannover, Germany, have a PhD in Computer Science, and work as Web Data Engineer for the Internet Archive. My focus is on the effective and efficient use of web archives and related topics, such as big data processing and information retrieval.