代码改变世界

NoSQL 数据库汇总

2011-12-08 16:17  Haippy  阅读(1422)  评论(0编辑  收藏  举报
Core NoSQL Systems: [Mostly originalted out of a Web 2.0 need]

Wide Column Store / Column Families
Hadoop / HBase: API: Java / any writer, Protocol: any write call, Query Method: MapReduce Java / any exec, Replication: HDFS Replication, Written in: Java, Concurrency: ?, Misc: Links: 3 Books [1, 2, 3]

Cassandra: API: many Thrift » languages, Protocol: ?, Query Method: MapReduce, Replicaton: , Written in: Java, Concurrency: eventually consistent , Misc: like "Big-Table on Amazon Dynamo alike",  initiated by Facebook, Slides » , Clients », Installation »

Hypertable:  API: Thrift (Java, PHP, Perl, Python, Ruby, etc.), Protocol: Thrift, Query Method: HQL, native Thrift API, Replication: HDFS Replication, Concurrency: MVCC, Consistency Model:  Fully consistent Misc:  High performance C++ implementation of Google's Bigtable.  Commercial support  »

Amazon SimpleDB: Misc: not open source / part of AWS, Book »

Cloudata: Google's Big table clone like HBase. Article »
Cloudera: Professional Software & Services based on Hadoop.
SciDB: Array Data Model for Scientists, paper », poster », HiScaBlog »
HPCC: from LexisNexis, info, article
Stratosphere:  (research system) massive parallel & flexible execution, M/R generalization and extention (paper, poster).
[OpenNeptune, Qbase, KDI]:


Document Store
MongoDB:  API: BSON, Protocol: lots of langs, Query Method: dynamic object-based language & MapReduce, Replication: Master Slave & Auto-Sharding, Written in: C++,Concurrency: Update in Place. Misc: Indexing, GridFS Links: Talk », Notes »

CouchDB:  API: JSON, Protocol: REST, Query Method: MapReduceR of JavaScript Funcs, Replication: Master Master, Written in: Erlang, Concurrency: MVCC,  Misc
Links: 3 CouchDB books », Couch Lounge » (partitioning / clusering),  Dr. Dobbs »

OrientDB: Languages: Java, Schema: Has features of an Object-Database, DocumentDB, GraphDB or Key-Value DB, Written in: Java, Query Method: Native and SQL, Misc: really fast, lightweight, ACID with recovery.

RavenDB
: .Net solution. Provides HTTP/JSON access. LINQ queries & Sharding supported. Misc: »

Citrusleaf:

Clusterpoint Server: (freeware+commercial) API: XML, PHP, Java, .NET Protocols: HTTP, REST, native TCP/IP Query Method: full text search, XML, range and Xpath queries; Written in C++ Concurrency: ACID-compliant, transactional, multi-master cluster Misc: Petabyte-scalable document store and full text search engine.  Information ranking. Replication. Cloudable.

ThruDB: (please help provide more facts!) Uses Apache Thrift to integrate multiple backend databases as BerkeleyDB, Disk, MySQL, S3.

Terrastore: API: Java & http, Protocol: http, Language: Java, Querying: Range queries, Predicates, Replication: Partitioned with consistent hashing, Consistency: Per-record strict consistency, Misc: Based on Terracotta

SisoDB:  A Document Store on top of SQL-Server.
[RaptorDB, CloudKit, Perservere, Jackrabbit »]


Key Value / Tuple Store
Azure Table Storage:  Collections of free form entities (row key, partition key, timestamp). Blob and Queue Storage available, 3 times redundant. Accessible via REST or ATOM.

MEMBASE: API: Memcached API+protocol (binary and ASCII) , most languages, Protocol: Memcached REST interface for cluster conf + management, Written in: C/C++ + Erlang (clustering), Replication: Peer to Peer, fully consistent, Misc: Transparent topology changes during operation, provides memcached-compatible caching buckets, commercially supported version available, Links: Wiki », Article » 

Riak: API: JSON, Protocol: REST, Query Method: MapReduce term matching , Scaling: Multiple Masters; Written in: Erlang, Concurrency: eventually consistent (stronger then MVCC via Vector Clocks), Misc: ... Links: talk »,

Redis :  API: Tons of languages, Written in: C, Concurrency: in memory and saves asynchronous disk after a defined time. Append only mode available. Different kinds of fsync policies. Replication: Master / Slave, Misc: also lists, sets, sorted sets, hashes, queues. Cheat-Sheet: », great  slides »  Admin UI » From the Ground up »

LevelDB: Fast & Batch updates. DB from Google. Blog », hot Benchmark », Article » (in German).

Chordless: API: Java & simple RPC to vals, Protocol: internal, Query Method: M/R inside value objects, Scaling: every node is master for its slice of namespace, Written in: Java, Concurrency: serializable transaction isolation, Links:

GenieDB: Immediate consistency sharded KV store with an eventually consistent AP store bringing eventual  consistency issues down to the theoretical minimum. It features efficient record coalescing. GenieDB speaks SQL and co-exists /  do intertable joins with SQL RDBMs.

Scalaris: (please help provide more facts!) Written in: Erlang, Replication: Strong consistency over replicas, Concurrency: non blocking Paxos.

Tokyo Cabinet / Tyrant: Links: nice talk », slides », Misc: KyotoCabinet »

GT.M: API: M, C, Python, Perl, Protocol: native, inprocess C, Misc: Wrappers: M/DB for SimpleDB compatible HTTP », MDB:X for XML », PIP for mapping to tables for SQL », Features: Small footprint (17MB), Terabyte Scalability, Unicode support, Database encryption, Secure, ACID transactions (single node), eventual consistency (replication), License: AGPL v3 on x86 GNU/Linux, Links: Slides »,

Scalien:  API / Protocol: http (text, html, JSON), C, C++, Python, Java, Ruby, PHP,Perl. Concurrency: Paxos.

Berkeley DB: API: Many languages, Written in: C, Replication: Master / Slave, Concurrency: MVCC, License: Sleepycat, Berkeley DB Java Edition: API: Java, Written in: Java, Replication: Master / Slave, Concurrency: serializable transaction isolation, License: Sleepycat

MemcacheDB: API: Memcache protocol (get, set, add, replace, etc.), Written in: C, Data Model: Blob, Misc: Is Memcached writing to BerkleyDB.

Faircom C-Tree: API: C, C++, C#, Java, PHP, Perl, Written in: C,C++. Misc: Transaction logging. Client/server. Embedded. SQL wrapper (not core). Been around since 1979.

HamsterDB:  (embedded solution) ACID Compliance, Lock Free Architecture (transactions fail on conflict rather than block), Transaction logging & fail recovery (redo logs), In Memory support – can be used as a non-persisted cache, B+ Trees – supported [Source: Tony Bain »]

STSdb: API: C#, Written in C#, embedded solution, generic XTable<TKey,TRecord> implementation,
ACID transactions, snapshots, table versions, shared records, vertical data compression, custom compression, composite & custom primary keys, available backend file system layer, works over multiple volumes, petabyte scalability, LINQ.

Tarantool/Box:  API: C, Perl, PHP, Python, Java and Ruby. Written in: Objective C ,Protocol: asynchronous binary, memcached, text (Lua console). Data model: collections of dimensionless tuples, indexed using primary + secondary keys. Concurrency: lock-free in memory, consistent with disk (write ahead log). Replication: master/slave, configurable. Other: call Lua stored procedures.

Maxtable API: C, Query Method: MQL, native API, Replication: DFS Replication, Consistency: strict consistency  Written in: C.

Pincaster: For geolocalized apps. Concurrency: in-memory with asynchronous disk writes. API: HTTP/JSON. Written in: C. License: BSD.

RaptorDB:  A pure key value store with optimized b+tree and murmur hashing. (In the near future it will be a JSON document database much like mongodb and couchdb.)

Mnesia: (ErlangDB »)
LightCloud:  (based on Tokyo Tyrant)
Hibari: Hibari is a highly available, strongly consistent, durable, distributed key-value data store

TIBCO Active Spaces: peer-to-peer distributed in-memory (with persistence) datagrid that implements and expands on the concept of the Tuple Space. Has SQL Queries and ACID (=> NewSQL).

allegro-C: Key-Value concept. Variable number of keys per record. Multiple key values, Hierarchic records. Relationships. Diff. record types in same DB. Indexing: B*-Tree. All aspects configurable. Full scripting language. Multi-user ACID. Web interfaces (PHP, Perl, ActionScript) plus Windows client.

[Scality », KaTree »TomP2P », Kumofs » , TreapDB », NMDB, luxio, actord, keyspace, flare, schema-free, RAMCloud]


Eventually Consistent Key Value Store
Amazon Dynamo: Misc: not open source (see KAI below)

Voldemort: Open-Source implementation of Amazons Dynamo Key-Value Store.

Dynomite: Open-Source implementation of Amazons Dynamo Key-Value Store. written in Erlang. With "data partitioning, versioning, and read repair, and user-provided storage engines provide persistence and query processing".

KAI: Open Source Amazon Dnamo implementation, Misc: slides ,

[SubRecord, Mo8onDb, Dovetaildb]


Graph Databases »
Neo4J:  API: lots of langs, Protocol: Java embedded / REST, Query Method: SparQL, nativeJavaAPI, JRuby, Replication: typical MySQL style master/slave, Written in: Java, Concurrency: non-block reads, writes locks involved nodes/relationships until commit, Misc: ACID possible, Links: Video », good Blog » 

Infinite Graph: (by Objectivity) API: Java, Protocol: Direct Language Binding, Query Method: Graph Navigation API, Predicate Language Qualification, Written in: Java (Core C++), Data Model: Labeled Directed Multi Graph, Concurrency: Update locking on subgraphs, concurrent non-blocking ingest, Misc: Free for Qualified Startups.

Sones: OO Graph DB, API: .NET, Protocol: .NET embedded, REST, WebServices, Query Method: Graph Query Language, LINQ (M/R), Replication: Master-Master, Written in: C#, Concurrency: MVCC

InfoGrid:  API: Java, http/REST, Protocol: as API + XPRISO, OpenID, RSS, Atom, JSON, Java embedded, Query Method: Web user interface with html, RSS, Atom, JSON output, Java native, Replication: peer-to-peer, Written in: Java, Concurrency: concurrent reads, write lock within one MeshBase, Misc: Presentation »

HyperGraphDB:  API: Java (and Java Langs), Written in:Java,  Query Method: Java or P2P, Replication: P2P, Concurrency: STM, Misc: Open-Source, Especially for AI and Semantic Web.

Trinity: API: C#, Protocol: C# Language Binding, Query Method: Graph Navigation API, Replication: P2P with Master Node, Written in: C#, Concurrency: Yes (Transactional update in online query mode, Non-blocking read in Batch Mode) Misc: distributed in-memory storage, parallel graph computation platform

AllegroGraph: API: Java, Python, Ruby, C#, Perl, Clojure, Lisp Protocol: REST, Query Method: SPARQL and Prolog, Libraries: Social Networking Analytics & GeoSpatial, Written in: Common Lisp , Links: Learning Center », Videos »

Bigdata:  API: Java, Jini service discovery, Concurrency: very high (MVCC), Written in: Java, Misc: GPL + commercial, Data: RDF data with inference, dynamic key-range sharding of indices, Misc: Blog » (parallel database, high-availability architecture, immortal database with historical views)

DEX: API: Java,  Protocol: Java Embedded, Query Method: Java API, Written in: Java / C++, Data Model: Labeled Directed Attributed Multigraph, Concurrency: yes, Misc: Free community edition up to 1 Mio nodes, Links: Intro », Tutorial »

OpenLink Virtuoso: Hybrid DBMS covering the following models: Relational, Document, Graph.
VertexDB:
FlockDB: by twitter» »
AllegroGraph:
[Java Universal Network / Graph Framework, Sesame, Filament, OWLim, NetworkX, iGraph]

Soft NoSQL Systems:
[Mostly NOT originalted out of a Web 2.0 need but worth a look for great non relational solutions]


Object Databases     (more info at ODBMS.org)
db4o: API: Java, C#, .Net Langs, Protocol: language, Query Method: QBE (by Example), Soda, Native Queries, LINQ (.NET),  Replication: db4o2db4o & dRS to relationals, Written in: Java,  Cuncurrency: ACID serialized, Misc: embedded lib, Links: DZone Refcard #53 », Book »

Versant: Languages/Protocol: Java, C#, C++, Python. Schema: language class model (easy changable). Modes: always consistent and eventually consistent  Replication: synchronous fault tolerant and peer to peer asynchronous. Concurrency:  optimistic and object based locks. Scaling: can add physical nodes on fly for scale out/in and migrate objects between nodes without impact to application code. Misc: MapReduce via parallel SQL like query across logical database groupings.

Objectivity: Languages: Java, C#, C++, Python, Smalltalk, SQL access through ODBC. Schema: native language class model, direct support for references, interoperable across all language bindings. 64 bit unique object ID (OID) supports multi exa-byte.  Platforms: 32 and 64 bit Windows, Linux, Mac OSX, *Unix. Modes: always consistent (ACID).  Concurrency: locks at cluster of objects (container) level. Scaling: unique distributed architecture, dynamic addition/removal of clients & servers, cloud environment ready. Replication: synchronous with quorum fault tolerant across peer to peer partitions.

Starcounter: API: C# (.NET languages), Schema: Native language class model, Query method: SQL, Concurrency: Fully ACID compliant, Storage: In-memory with transactions secured on disk, Reliability: Full checkpoint recovery, Misc: VMDBMS - Integrating the DBMS with the virtual machine for maximal performance and ease of use.


Perst: API: Java,Java ME,C#,Mono. Query method: OO via Perst collections, QBE, Native Queries, LINQ, native full-text search, JSQL Replication: Async+sync (master-slave) Written in: Java, C#.
Caching: Object cache (LRU, weak, strong), page pool, in-memory database Concurrency: Pessimistic+optimistic (MVCC) + async or sync (ACID) Index types: Many tree models + Time Series. Misc.:Embedded lib., encryption, automatic recovery, native full text search, on-line or off-line backup.


Magma: Smalltalk DB, optimistic locking, Transactions, etc.

NEO:  API: Python - ZODB "Storage" interface,  Protocol: native,  Query Method: transactional key-value,  Replication: native,  Written in: Python,  Concurrency: MVCC (internally),  License: GPL "v2 or later",  Misc: Load balancing, fault tolerant, hot-extensible.

PicoLisp: Language and Object Database, can be viewed as a Database Development Framework. Schema: native language class model with relations + various indexes. Queries: language build in + a small Prolog like DSL Pilog. Concurrency: synchronization + locks. Replication, distribution and fault tolerance is not implemented per default but can be implemented with native functionality. Written in C (32bit) or assembly (64bit).

Sterling: is a lightweight object-oriented database for .NET with support for Silverlight and Windows Phone 7. It features in-memory keys and indexes, triggers, and support for compressing and encrypting the underlying data.

Morantex: Stores .NET classes in a datapool. Build for speed. SQL Server integration. LINQ support.
[Gemstone, Progress]

EyeDB
EyeDB is an LGPL OODBMS, provides an advanced object model (inheritance, collections, arrays, methods, triggers, constraints, reflexivity), an object definition language based on ODMG ODL, an object query and manipulation language based on ODMG OQL. Programming interfaces for C++ and Java.

[StupidDB », KiokuDB » (Perl solution), Durus » ]



Grid & Cloud Database Solutions
GigaSpaces: Popular SpaceBased Grid Solution.

Queplix: NOSQL Data Integration Environment, can integrate relational, object, BigData – NOSQL easily and without any SQL.

Hazelcast: P2P Data Grid Solution on java.util.*, On a 100 Noce EC2 Cluster »

Joafip:

[GridGain, Infinispan, Coherence, eXtremeScale
]



XML Databases
Mark Logic Server:  (commercial system) API: Java, .NET ProtocolsHTTP, REST Query Method: XQuery, XSLT Written in: C++ (code closed, API open) Concurrency: Share-nothing cluster architecture Misc: Petabyte-scalable XML server. Cloudable. Combines application server, transactional persistent storage (XML, text, and binary), and a full-text search engine. Developer Network »

EMC Documentum xDB:   (commercial system) API: Java, XQuery, Protocol: WebDAV, web services, Query method: XQuery, XPath, XPointer, Replication: lazy primary copy replication (master/replicas), Written in: Java, Concurrency: concurrent reads, writes with lock; transaction isolation, Misc: Fully transactional persistent DOM; versioning; multiple index types; metadata and non-XML data support; unlimited horizontal scaling. Developer Network »

eXist: API: XQuery, XML:DB API, DOM, SAX, Protocols: HTTP/REST, WebDAV, SOAP, XML-RPC, Atom, Query Method: XQuery, Written in: Java (open source), Concurrency: Concurrent reads, lock on write; Misc: Entire web applications can be written in XQuery, using XSLT, XHTML, CSS, and Javascript (for AJAX functionality). (1.4) adds a new full text search index based on Apache Lucene, a lightweight URL rewriting and MVC framework, and support for XProc.

Sedna: Misc: ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.

BaseX:  BaseX is a fast, powerful, lightweight XML database system and XPath/XQuery processor with highly conformant support for the latest W3C Update and Full Text Recommendations. Client/Server architecture, ACID transaction support, user management, logging, Open Source, BSD-license, written in Java, runs out of the box.

Qizx:  commercial and open source version, API: Java, Protocols: HTTP, REST, Query Method: XQuery, XQuery Full-Text, XQuery Update, Written in: Java, full source can be purchased, Concurrency: Concurrent reads & writes, isolation, Misc: Terabyte scalable, emphasizes query speed.

Berkeley DB XML: API: Many languages, Written in: C++, Query Method: XQuery, Replication: Master / Slave, Concurrency: MVCC, License: Sleepycat

[Xindice Tamino ]


Multivalue Databases
U2 (UniVerse, UniData): MultiValue Databases, Data Structure: MultiValued, Supports nested entities, Virtual Metadata, API: BASIC, InterCall, Socket, .NET and Java API's, IDE: Native, Record Oriented, Scalability: automatic table space allocation, Protocol: Client Server, SOA,  Terminal Line, X-OFF/X-ON, Written in: C, Query Method: Native mvQuery, (Retrieve/UniQuery) and SQL, Replication: yes, Hot standby, Concurrency: Record and File Locking (Fine and Coarse Granularity)

OpenInsight: API:  Basic+, .Net, COM, Socket, ODBC, Protocol: TCP/IP, Named Pipes, Telnet, VT100. HTTP/S Query Method: RList, SQL & XPath
Written in: Native 4GL, C, C++, Basic+, .Net, Java  Replication: Hot Standby Concurrency: table &/or row locking, optionally transaction based & commit & rollback Data structure: Relational &/or MultiValue, supports nested entities Scalability: rows and tables size dynamically

Reality
(
Northgate IS): The original MultiValue data set database, virtual machine, enquiry and rapid development environment. Delivers ultra efficiency, scalability and resilience while extended for the web and with built-in auto sizing, failsafe and more. Interoperability includes Web Services, Java Classes, XML, ActiveX, Sockets, C and, for those that have to interoperate with the SQL world, ODBC/JDBC and two-way transparent SQL data access.

OpenQM:
Supports nested data. Fully automated table space allocation. Concurrency control via task locks, file locks & shareable/exclusive record locks. Case insensitivity option. Secondary key indices. Integrated data replication. QMBasic programming language for rapid development. OO programming integrated into QMBasic. QMClient connectivity from Visual Basic, PowerBasic, Delphi, PureBasic, ASP, PHP, C and more. Extended multivalue query language.

Globals: by Intersystems, multidimensional

ESENT: (by Microsoft)  ISAM storage technology. Access using index or cursor navigation. Denormalized schemas, wide tables with sparse columns, multi-valued columns, and sparse and rich indexes. C# and Delphi drivers available. Backend for a number of MS Products as Exchange.


other NoSQL related databases
IBM Lotus/Domino: Type: Document Store, API: Java, HTTP, IIOP, C API, REST Web Services, DXL, Languages: Java, JavaScript, LotusScript, C, @Formulas, Protocol: HTTP, NRPC, Replication: Master/Master, Written in: C
Concurrency: Eventually Consistent, Scaling: Replication Clusters

Intersystems Cache:
Postrelational System. Caché has JDBC, ODBC and sql interfaces, several programming languages and many more. The system is availible for Windows, Linux and OpenVMS.

eXtremeDB:
Type: In-Memory Database; Written in: C; API: C/C++, SQL, JNI, C#(.NET), JDBC; Replication: Async+sync (master-slave), Cluster;
Scalability: 64-bit and MVCC

RDM Embedded: APIs: C++, Navigational C. Embedded Solution that is ACID Compliant with Multi-Core, On-Disk & In-Memory Support. Distributed Capabilities, Hot Online Backup, supports all Main Platforms. Supported B Tree & Hash Indexing. Replication: Master/Slave, Concurrency: MVCC. Client/Server: In-process/Built-in.

ISIS Family: (semistructured databases) »
Prevayler:Java RAM Data structure journalling.
Yserial
: Python wrapper over sqlite3


unresolved and uncategorized
VMware vFabric GemFire: »  (coming soon!)

Btrieve
: (by Pervasive Software) key/index/tupel DB. Using Pages.  » (faq »)

Tokutek:

Recutils: GNU Tool for text files containing records and fields. Manual »

FileDB: Mainly targeted to Silverlight/Windows Phone developers but its also great for any .NET application where a simple local database is required, extremely Lightweight - less than 50K, stores one table per file, including index, compiled versions for Windows Phone 7, Silverlight and .NET, fast, free to use in your applications

illuminate Correlation Database », FluidDB (Column Oriented DB) »,Fleet DB », Btrieve ,
Twisted Storage », Ringo
», Sherpa », tin », Dryad»SkyNet », Disco »

Possibly the oldest NoSQL DB: Adabas »

参考:nosql-databases.org