GB/Z 43768-2024 Information and documentation—Statistics and quality issues for web archiving
GB/Z 43768-2024 Information and documentation—Statistics and quality issues for web archiving
Basic Information
Scope
This document defines statistical data, terminology, and quality standards for web archiving. It takes into account the needs and practices of numerous organizations such as libraries, archives, museums, research centers, and cultural heritage foundations. This document is intended for experts directly involved in web archiving, typically a team consisting of decision-makers, engineers, and preservation managers from web archiving institutions. It is also useful for funding agencies and stakeholders of web archiving institutions. The professional terminology used in this document aims to reflect the broad interests and expertise of its audience, striking a balance between computer science, management, and library science. This document does not apply to the management of academic and commercial electronic resources such as e-journals, e-newspapers, or e-books, which are typically stored and processed separately using different management systems. Although these resources are considered internet resources, they are not discussed as specific content streams of web archiving in this document. Some organizations also collect electronic documents distributed via the internet, such as those in publishers' electronic repositories and storage systems, which are also beyond the scope of this document. The principles and technologies used for such collection differ significantly from those of web archiving, so the statistical data and quality indicators in this document may not necessarily apply to them. This document focuses on the principles and methods of web archiving and does not include other approaches to collecting internet resources. In fact, some internet resources, particularly those not disseminated online (such as communications sent via email), are not collected using web archiving technologies but rather through other methods, which are also beyond the scope of this document.