Skip to main content

Data Management

“NSF envisions a world in which digital science and engineering data are routinely deposited in convenient repositories, can be readily discovered in well-documented form by specialists and non-specialists alike, are open and accessible, and are reliably preserved.”

NSF Cyberinfrastructure Vision for 21st Century Discovery

High-Speed Scalable Storage

The development of data sharing and data management plans are now required by many funding agencies including the NSF and NIH. To implement these plans, many researchers prefer a high-speed scalable storage architecture.

CAC operates a large capacity DataDirect Networks (DDN) storage system that provides 6GB/sec aggregate reads and writes, 8 data+2 parity drives RAID6 for protecting against drive failures, and continuous integrity check and error correction against silent data corruptions.

This system is available for standard file storage and special options are available such as disk slot leases and low-cost archival storage with fast parallel file transfers via Globus Online.

For details on these data management technologies, visit storage services.

Database Management Systems

Researchers and educators are increasingly leveraging new technologies such as NoSQL databases that do not require fixed table schemas and scale horizontally.

Our consultants are available to help you implement these technologies or to use general purpose database management systems such as PostgreSQL, SQLite, MySQL, or Microsoft SQL Server. End-user queries are optimized for the best user experience. A portion of a CAC server may be leased to reduce data management system deployment and administrative costs.

CAC works with industry to test and leverage the latest data management and storage technologies. For example, CAC is piloting Ceph with Dell and Red Hat to explore the potential of a unified data storage solution supporting object storage with layered file systems including Lustre. CAC seeks a storage solution with the flexibility to support all their research needs and a scalable upgrade path at an optimum price point.

The Department of Computer Science is engaged in basic database systems research.