Large Archive Storage Systems Bibliography

Home
[littman-connaway-duke]
Justin Littman and Lynn Silipigni Connaway.A Circulation Analysis of Print Books and e-Books in an Academic Research Library.Library Resources and Technical Services.2004. [website]
[rosellicomparison]
Drew Roselli and Jacob R. Lorch and Thomas E. Anderson.A Comparison of File System Workloads.Proceedings of the 2000 USENIX Annual Technical Conference (USENIX-00).2000. [website]
[ smith96comparison]
Keith A. Smith and Margo I. Seltzer.A Comparison of {FFS} Disk Allocation Policies.{USENIX} Annual Technical Conference.1996.[website]
[oai:CiteSeerPSU:88493]
David Hawking and Peter Bailey.A Parallel Architecture for Query Processing Over A Terabyte of Text.1996. [website]
[oai:arXiv.org:cs/0403021]
Tom Barclay and Wyman Chong and Jim Gray.A Quick Look at {SATA} Disk Performance.2004.[website]
[ mitzenmacher-brief]
M. Mitzenmacher.A brief history of generative models for power law and lognormal distributions.[website]
[jones00transaction]
Steve Jones and Sally Jo Cunningham and Rodger J. McNab and Stefan J. Boddie.A transaction log analysis of a digital library.Int. j. on Digital Libraries.2000. [website]
[DBLP:conf/pakdd/2004]
Advances in Knowledge Discovery and Data Mining, 8th Pacific-Asia Conference, PAKDD 2004, Sydney, Australia, May 26-28, 2004, Proceedings.2004.
[jones98analysis]
Steve Jones and Sally Jo Cunningham and Rodger J. McNab.An Analysis of Usage of a Digital Library.European Conference on Digital Libraries.1998. [website]
[oai:CiteSeerPSU:586848]
Ann Chervenak-drapeau and David A. Patterson and Ethan Miller and Joel Fine and Y H. Katz.An Approach to Cost-Effective Terabyte Memory Systems.1992. [website]
[ grandi03annotated]
F. Grandi.An annotated bibliography on temporal and evolution aspects in the world wide web.2003. [website]
[oai:CiteSeerPSU:282431]
Sally Jo Cunningham.Applications for Bibliometric Research in the Emerging Digital Libraries.1998. [website]
[ARCFileFormat]
Mike Burner and Brewster Kahle.Arc File Format.retrieved from http://www.archive.org/web/researcher/ArcFileFormat.php.1996. [website]
[Crespo98]
Arturo Crespo and Hector Garcia-Molina.Archival Storage for Digital Libraries.DL'98: Proceedings of the 3rd ACM International Conference on Digital Libraries.1998. [website]
[oai:CiteSeerPSU:306670]
Ian Willers and Koen Holtman and Peter Van Der Stok.Automatic Reclustering of Objects in Very Large Databases for High Energy Physics.1998. [website]
[computer-benchmarks]
DA. Forsyth.Benchmarks for storage and retrieval in multimedia databases.Proceedings of Spie - The International Society for Optical Engineering.2001.[website]
[Powers94]
Alan K. Powers.Beyond a Terabyte File System.Proceedings of the Spring Cray Users Group Conference (33rd Spring CUG'94).1994.
[oai:CiteSeerPSU:647412]
Frank Havemann.Bibliometric Indicators and their Use for Research Evaluation:.2003. [website]
[Hawkins2001]
Donald T Hawkins.Bibliometrics of Electronic Journals in Information Science..Information Research.2001. [website]
[ogle95chabot]
Virginia E. Ogle and Michael Stonebraker.Chabot: Retrieval from a Relational Database of Images.IEEE Computer.1995. [website]
[oai:pubmedcentral.gov:225638]
J F Burnham and B S Shearer and J C Wall.Combining new technologies for effective collection development: a bibliometric study using {CD}-{ROM} and a database management program..2003. [website]
[DSpace::Year]
MacKenzie Smith and Richard Rodgers and Julie Walker and Robery Tansley.DSpace: A Year in the Life of an Open Source Digital Repository System.Research and Advanced Technology for Digital Libraries: 8th European Conference, ECDL 2004.2004. [website]
[santry99deciding]
Douglas S. Santry and Michael J. Feeley and Norman C. Hutchinson and Alistair C. Veitch and Ross W. Carton and Jacob Ofir.Deciding when to forget in the Elephant file system.Symposium on Operating Systems Principles.1999. [website]
[oai:DLIST.OAI2:33]
Dr David C Nagel and Dr. Ching-chih Chen and Dr. James N. Gray and Dr. Robert E. Kahn and Dr. Raj Reddy.Digital Libraries: Universal Access to Human Knowledge.2001. [website]
[Wiederhold:1995:DLV]
Gio Wiederhold.Digital libraries, value, and productivity.Communications of the ACM.1995. [website]
[Turbyfill88]
Carolyn Turbyfill.Disk Performance and Access Patterns for Mixed Database Workloads.IEEE CS Technical Com. on Database Engineering Bulletin.1988.
[mitzenmacher02dynamic]
M. Mitzenmacher.Dynamic models for file sizes and double pareto distributions.2002. [website]
[ mitzenmacher02dynamic]
M. Mitzenmacher.Dynamic models for file sizes and double pareto distributions.2002.[website]
[oai:CiteSeerPSU:648302]
Christos Karamanolis and Lawrence L. You.Evaluation of Efficient Archival Storage Techniques.2004. [website]
[DBLP:conf/pakdd/RuttP04]
Benjamin Rutt and Srinivasan Parthasarathy.Exploiting Recurring Usage Patterns to Enhance Filesystem and Memory Subsystem Performance..PAKDD.2004.[website]
[301281]
Edith Cohen and Haim Kaplan.Exploiting regularities in Web traffic patterns for cache replacement.STOC '99: Proceedings of the thirty-first annual ACM symposium on Theory of computing.1999.[website]
[frolund03fab]
S. Frolund and A. Merchant and Y. Saito and S. Spence and A. Veitch.FAB: Enterprise storage systems on a shoestring.Proc. 9th Workshop on Hot Topics in Operating Systems.2003. [website]
[SmithSeltzerTR]
Kevin A. Smith and Margo Seltzer.File Layout and File System Performance.Harvard Computer Science.1994.[website]
[anderson01hippodrome]
E. Anderson and M. Hobbs and K. Keeton and S. Spence and M. Uysal and A. Veitch.Hippodrome: running circles around storage administration.2001. [website]
[DBLP:journals/jasis/Bookstein97]
Abraham Bookstein.Informetric Distributions. III. Ambiguity and Randomness..JASIS.1997.
[DBLP:journals/jasis/Bookstein90]
Abraham Bookstein.Informetric distributions, part I: Unified overview..JASIS.1990.
[DBLP:journals/jasis/Bookstein90a]
Abraham Bookstein.Informetric distributions, part II: Resilience to ambiguity..JASIS.1990.
[oai:eprints.rclis.org:3297]
Leo Egghe and Ronald Rousseau.Introduction to Informetrics : quantitative methods in library, documentation and information science.1990. [website]
[oai:CiteSeerPSU:678000]
Anastassia Ailamaki and Gregory R. Ganger and Jiri Schindler.Matching Database Access Patterns to Storage Characteristics.2003. [website]
[oai:eprints.rclis.org:3610]
Jakob Voss.Measuring Wikipedia.2005. [website]
[crespo00modeling]
Arturo Crespo and Hector Garcia-Molina.Modeling Archival Repositories for Digital Libraries.Lecture Notes in Computer Science.2000. [website]
[oai:arXiv.org:physics/0411188]
D. A. Sanders and L. M. Cremaldi and V. Eschenburg and R. Godang and M. D. Joy and D. J. Summers and D. L. Petravick.Multi-Terabyte {EIDE} Disk Arrays running Linux {RAID5}.2004. [website]
[ge-petabyte]
To Ra Ge.Petabyte File Systems Based on Tertiary Storage.1996. [website]
[silverstein-predicting]
Craig Silverstein and Stuart M. Shieber.Predicting Book Use for Off-Site Storage. [website]
[recker96predicting]
Margaret M. Recker and James E. Pitkow.Predicting document access in large multimedia repositories.ACM Transactions on Computer-Human Interaction.1996. [website]
[Waugh00]
Andrew Waugh and Ross Wilkinson and Brendan Hills and Jon Dell'oro.Preserving Digital Information Forever.DL'00: Proceedings of the 5th ACM International Conference on Digital Libraries.2000. [website]
[320095]
Chat-Yu Lam and Stuart E. Madnick.Propeties of storage hierarchy systems with multiple page sizes and redundant data.ACM Trans. Database Syst..1979. [website]
[oai:CiteSeerPSU:505068]
C. N. Lawrence and D. J. Summers and D. L. Petravick and L. M. Cremaldi and V. Eschenburg.Redundant Arrays of {IDE} Drives.2001. [website]
[oai:CiteSeerPSU:540927]
Clement H. C. Leung and Philip K. C. Tse.Retrieving Multimedia Objects From Hierarchical Storage Systems.2001. [website]
[oai:CiteSeerPSU:314563]
Alok Choudhary and Sachin More.Scheduling Queries on Tape-resident Data.2000. [website]
[oai:arXiv.org:cs/0502012]
Peter Kukol and Jim Gray.Sequential File Programming Patterns and Performance with .{NET}.2005. [website]
[oai:CiteSeerPSU:99678]
Michael Stonebraker and Sunita Sarawagi.Single Query Optimization for Tertiary Memory.1994. [website]
[oai:CiteSeerPSU:538438]
Michelle L. Butler.Storage Issues at {NCSA}: How to get file systems going wide and fast within and out of large scale Linux cluster systems.2002. [website]
[MSR-TR-2002-54]
Jim Gray and Wyman Chong and Tom Barclay and Alex Szalay and Jan vandenBerg.TeraScale SneakerNet: Using Inexpensive Disks for Backup, Archiving, and Data Exchange.Microsoft Research (MSR).2002.
[MSR-TR-2004-107]
Tom Barclay and Jim Gray and Wyman Chong.TerraServer Bricks -- {A} High Availability Cluster Alternative.Microsoft Research (MSR).2004.
[MSR-TR-2004-67]
Tom Barclay and Jim Gray.TerraServer Cluster and {SAN} Experience.Microsoft Research (MSR).2004.
[UCB//CSD-98-989]
Nisha Talagala and Satoshi Asami and Thomas Anderson and David Patterson.Tertiary Disk: Large Scale Distributed Storage.University of California, Berkeley.1998.
[ olson93design]
M. A. Olson.The Design and Implementation of the {Inversion} File System.Proceedings of the {USENIX} Winter 1993 Technical Conference.1993. [website]
[DBLP:conf/colis/Hayes99]
Robert M. Hayes.The Economics of Digital Libraries..Digital Libraries: Interdisciplinary Concepts, Challenges and Opportunities, CoLIS3 Proceedings, Dubrovnik, Croatia, 23-26 May 1999.1999.[website]
[ghemawat:googlefs]
Sanjay Ghemawat and Howard Gobioff and Shun-Tak Leung.The Google File System.Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles.2003. [website]
[cooper-stanford]
Brian F. Cooper and Arturo Crespo and Hector Garcia-Molina.The Stanford Archival Repository Project: Preserving our digital past. [website]
[downey01structural]
Allen B. Downey.The structural cause of file size distributions.SIGMETRICS/Performance.2001. [website]
[larson95sequoia]
Ray R. Larson and Christian Plaunt and Allison G. Woodruff and Marti Hearst.The {Sequoia} 2000 Electronic Repository.Digital Technical Journal of Digital Equipment Corporation.1995. [website]
[DBLP:conf/vissym/KosaraBH04]
Robert Kosara and Fabian Bendix and Helwig Hauser.TimeHistograms for Large, Time-Dependent Data..VisSym.2004.[website]
[moh04timeline]
C. Moh and B. Liskov.Timeline: A high performance archive for a distributed object store.First Symposium on Networked Systems Design and Implementation (NSDI).2004. [website]
[oai:CiteSeerPSU:554142]
Chunqiang Tang and Mallik Mahalingam and Zhichen Xu.Towards a Semantic, Deep Archival File System.2002. [website]
[mahalingam-towards]
Mallik Mahalingam and Chunqiang Tang and Zhichen Xu.Towards a Semantic, Deep Archival File System.The 9th International Workshop on Future Trends of Distributed Computing Systems (FTDCS 2003).2003. [website]
[oai:CiteSeerPSU:176632]
Michael Wan and Reagan Moore and Richard Frost and Richard Marciano and Tom Sherwin.Towards the Interoperability of Web, Database, and Mass Storage Technologies for Petabyte Archives.1996. [website]
[and-transactional]
Barbara Liskov And Rodrigo Rodrigues.Transactional File Systems Can Be Fast. [website]
[aranya04msthesis]
A. Aranya Versatile File System Tracing with Tracefs Stony Brook University 2004 August Technical Report FSL-04-05 www.fsl.cs.sunysb.edu/docs/tracefs-msthesis/tracefs.pdf, Aranya 2005-06-09 2005-06-09 . [website]
[DBLP:conf/wsc/MullerS03]
Wolfgang M{\"u}ller and Heidrun Schumann.Visualization for modeling and simulation: visualization methods for time-dependent data - an overview..Proceedings of the 35th Winter Simulation Conference: Driving Innovation, New Orleans, Louisiana, USA, December 7-10, 2003.2003.
[oai:DLIST.OAI2:393]
Anita Coleman and Chris Neuhaus.Web Metrics Bibliography.2004. [website]
[Barroso:2003:WSP]
Luiz Andr{\'e} Barroso and Jeffrey Dean and Urs H{\"o}lzle.Web Search for a Planet: The {Google Cluster Architecture}.IEEE Micro.2003. [website]
[Burrell:GIGP]
Quentin L. Burrell and Michael R. Fenton.Yes, the GIGP Really Does Work and Is Workable!.Journal of the American Society for Information Science.1993. [website]
[conference04usenix]
Zhihui Zhang and Kanad Ghose.yFS: A Journaling File System Design for Handling Large Data Sets with Reduced Seeking.Proceedings of Fast '03: 2nd USENIX Conference on File and Storage Technologies.2003.[website]
[oai:CiteSeerPSU:614973]
Brian Lent and George H. John.{SIP}ping from the Data Firehose.1997. [website]
[fast04tracefs]
A. Aranya and C. P. Wright and E. Zadok.{Tracefs: A File System to Trace Them All}.Proceedings of the Third USENIX Conference on File and Storage Technologies (FAST 2004).2004. [website]
[irlam93:ufs]
G. Irlam.{UNIX} File Size Survey ({UFS93}).World Wide Web document. URL: {\emph{http://www.base.com/gordoni/ufs93.html}}.1993. [website]
[Quinlan:2002:VNA]
Sean Quinlan and Sean Dorward.{Venti}: {A} New Approach to Archival Data Storage.Proceedings of the {FAST} '02 Conference on File and Storage Technologies: January 28-30, 2002, Monterey, California, {USA}.2002. [website]
[2005cs........2028V]
{Van de Sompel}, H. and {Bekaert}, J. and {Liu}, X. and {Balakireva}, L. and {Schwander}, T..{aDORe: a modular, standards-based Digital Object Repository}.ArXiv Computer Science e-prints.2005. [website]