Chok is a system for distributed data storage that provides real time access, scalability and failure tolerance. Chok serves large, replicated, data sets as shards to serve high loads and large amounts of data. These data set can be of different type. Currently reference implementations are available for Lucene and Hadoop mapfiles. Chok is a fork of Katta.