capture &| stream data between browsers w/o intermediary & plugins|3rd party
consequence of a messaging based approach: no longer a need for a single conceptual model to underpin the integration effort
Distributed commit log. good for high volume data processing pipelines and realtime/batch consumers.
backward compatibility, forward compatibility, and full compatibility
a MapReduce implementation that dramatically eases continuous upload of data into Hadoop clusters
High throughput;Low Latency No buffer management If serialized: no locking/batching
2 storage engines lockless in-memory & disk w/ 2level Btree transactions+replication w/ WAL for CD
ultra-fast transaction throughput at low latencies commonly used with a DW (or Hadoop) to optimize OLTP throughput & analytic queries/repor
Embedded persistent KVS Not distributed No failover Not highly-available, if machine dies you lose your data
messaging server persisting to RethinkDB
facilitates querying and managing large datasets residing in distributed storage
compute streams off other data streams in real-time as events occurred. useful in areas w/ lots of complex transformations
computational engine
Big data
abstract
capture &| stream data between browsers w/o intermediary & plugins|3rd party
articles
MOM
consequence of a messaging based approach: no longer a need for a single conceptual model to underpin the integration effort
Kafka
Distributed commit log. good for high volume data processing pipelines and realtime/batch consumers.
Confluent
backward compatibility, forward compatibility, and full compatibility
a MapReduce implementation that dramatically eases continuous upload of data into Hadoop clusters
IMDB
High throughput;Low Latency No buffer management If serialized: no locking/batching
2 storage engines lockless in-memory & disk w/ 2level Btree transactions+replication w/ WAL for CD
Datascript
VoltDB
ultra-fast transaction throughput at low latencies commonly used with a DW (or Hadoop) to optimize OLTP throughput & analytic queries/repor
MemSQL
RocksDB
Embedded persistent KVS Not distributed No failover Not highly-available, if machine dies you lose your data
deepstream
messaging server persisting to RethinkDB
Magnet
Avro
Hive
facilitates querying and managing large datasets residing in distributed storage
SmartStack
Consul
stream processing systems
compute streams off other data streams in real-time as events occurred. useful in areas w/ lots of complex transformations
HDFS
Hadoop
computational engine