XMill: an E cient Compressor for XML Data

University of Pennsylvania Technical Report MSCIS-99-26(1999)

引用 1|浏览4
暂无评分
摘要
We describe a tool for compressing XML data, called XMill, that usually achieves about twice the compression ratio of gzip at roughly the same speed. The intended applications are XML data exchange and archiving. XMill does not need schema information (such as a DTD or an XML-Schema), but can exploit hints about such a schema in order to further improve the compression ratio. XMill incorporates and combines existing compressors in order to compress heterogeneous XML data: it uses zlib, the library function for gzip, as well as a collection of datatype speci c compressors. XMill can be extended with new specialized compressors: this is useful in applications managing XML data with highly specialized data types, such DNA sequences, images, etc. The paper presents a theoretical justi cation for the method used, XMill architecture and implementation, a new languages for expression the hints about the XML schema, and a series of experiments validating XMill on several real data sets.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要