An Analysis of XML Compression Efficiency
Document Type
Article
Publication Date
10-10-2024
Abstract
XML simplifies data exchange among heterogeneous computers, but it is notoriously verbose and has spawned the development of many XML-specific compressors and binary formats. We present an XML test corpus and a combined efficiency metric integrating compression ratio and execution speed. We use this corpus and linear regression to assess 14 general-purpose and XML-specific compressors relative to the proposed metric. We also identify key factors when selecting a compressor. Our results show XMill or WBXML may be useful in some instances, but a general-purpose compressor is often the best choice.
ACM classes: E.4 ; H.1.1
DOI
arXiv:2410.07603
Source Publication
arXiv.org [cs.DB]
Recommended Citation
Augeri, C. J., Mullins, B. E., III, L. C. B., Bulutoglu, D. A., & Baldwin, R. O. (2024). An Analysis of XML Compression Efficiency (No. arXiv:2410.07603). arXiv. https://doi.org/10.48550/arXiv.2410.07603
arXiv:2410.07603 [cs.DB]
Comments
The "Link to Full Text" on this page opens or saves the PDF of the paper, hosted in the arXiv.org e-print repository. "cs.DB" refers to the Databases collection in arXiv.