The research paths I have taken over the last three decades have been highly collaborative and interdisciplinary and have spanned four main areas:
1. developing transparency and trustworthiness in algorithms and software (“digital trust”)
- I contributed to the development of algebraic methodology and software technologies and to computational geography and the management of spatial records at scale
2. developing records management software for handing large amounts of information
- while at the San Diego Supercomputer Center (SDSC), with my colleague Reagan Moore, we coined the concept of data grids and launched a data-intensive computing group, which led to the development of the Storage Resource Broker (SRB)
3. pioneering policy-enabled infrastructure
- the SRB led to the development of the open-source integrated Rule-Oriented Data System (iRODS), were data grids and business rules were combined to tackle long-term digital preservation challenges
4. pioneering Big Data / Big Records Curation
- In particular over the last 3 years at the Maryland iSchool,
i. We launched a Digital Curation Innovation Center (DCIC),
ii. We developed a next generation of scalable repositories called DRAS-TIC (Digital Repository At Scale That Invites Computation), now owned by UMD, and based on so-called NoSQL database technologies, the kind used by some 1,800 companies including eBay, GitHUB, Hulu, Instagram, Netflix, and Twitter.
iii. We are establishing a new discipline called Computational Archival Science (CAS)
iv. We are now working with 70 students on 8 hands-on team-based CAS projects with clients and deliverables.