Mapping and Reifying the Software Documentation Landscape

Faculty of Informatics - Academic Studies Administration

Date: 18 March 2025 / 15:30 - 18:30

USI East Campus, Room D0.02

You are cordially invited to attend the PhD Dissertation Defence of Marco Raglianti on Tuesday 18 March 2025 at 15:30 in room D0.02.

Abstract:
Every software system (ideally) comes with one or more forms of documentation, a fundamental asset of every software project. Documentation is deemed useful for different stages of the software lifecycle (e.g., design, implementation, maintenance) and is created and maintained by different stakeholders for different target audiences. In all this heterogeneity, multiple platforms (e.g., instant messaging, Q&A websites, social networks) host and contribute a substantial amount of documentation. Developers resort to Discord, StackOverflow, and a mix of Google, emails, and Tweets to "get things done". We refer to this evolving heterogeneous collection of information sources as the documentation landscape of a software system. A systematic approach to discover, classify, and integrate these sources in a unifying framework is fundamental to understand the paradigm shifts and complex phenomena affecting modern software documentation. To look at software documentation from a new perspective, we propose an approach to map and reify the documentation landscape of a software system. We identify and retrieve the sources of documentation deemed relevant by developers and investigate some of the many forms that documentation sources can take. We provide a holistic interpretation of the software documentation landscape and the phenomena determining its evolution. Traditional documentation is becoming a thing of the past, drifting towards fast (but also extremely volatile and noisy) communication channels, hosts of valuable chunks of the modern software documentation landscape. We complement our approach with an evolutionary analysis that explores in detail software documentation in instant messaging applications and UML artifacts. Through a mix of object oriented domain modeling and software visualization techniques, we characterize documentation sources according to form and content, analyzing their interplay. Finally, we discuss the current limitations and the potential future of the software documentation landscape, highlighting the need of a machine-readable specification to fully automate our mapping approach.

Dissertation Committee:
- Prof. Michele Lanza, Università della Svizzera italiana, Switzerland (Research Advisor)
- Prof. Gabriele Bavota, Università della Svizzera italiana, Switzerland (Internal Member)
- Prof. Carlo Alberto Furia, Università della Svizzera italiana, Switzerland (Internal Member)
- Prof. Gregorio Robles, Universidad Rey Juan Carlos, Spain (External Member)
- Prof. Federica Sarro, University College London, UK (External Member)
- Prof. Christoph Treude, Singapore Management University, Singapore (External Member)