“NSF envisions a world in which digital science and engineering data are routinely deposited in convenient repositories, can be readily discovered in
well-documented form by specialists and non-specialists alike, are open and accessible, and are reliably preserved.”
NSF Cyberinfrastructure Vision for 21st Century Discovery
High-Speed Scalable Storage
The development of data sharing and data management plans are now required by many funding agencies including the NSF and NIH. To implement these plans, many researchers prefer a high-speed scalable storage architecture.
CAC operates a large capacity DataDirect Networks (DDN) storage system that provides 6GB/sec aggregate reads and writes, 8 data+2 parity drives RAID6 for protecting against drive failures, and continuous integrity check and error correction against silent data corruptions.
This system is available for standard file storage and special options are available such as disk slot leases and low-cost archival storage with fast parallel file transfers via Globus Online.
For details on these data management technologies, visit storage services.
Database Management Systems
Researchers and educators are increasingly leveraging new technologies such as NoSQL databases that do not require fixed table schemas and scale horizontally.
Our consultants are available to help you implement these technologies or to use general purpose database management systems such as PostgreSQL, SQLite, MySQL, or Microsoft SQL Server. End-user queries are optimized for the best user experience. A portion of a CAC server may be leased to reduce data management system deployment and administrative costs.
While many database projects have modest scaling requirements, others must scale to tens or even hundreds of terabytes. Our database consulting staff have expertise in statistical modeling and the development of scientific workflows and automation for physics data, pulsar data, DNA sequencing, etc.
CAC works with industry to test and leverage the latest data management technologies. The Department of Computer Science is engaged in basic database systems research.