Posts Tagged ‘hadoop’

04.16
12

Out Hadoop Connector for MongoDB

by admin ·

Submitted release Mongo + Hadoop Connector – set of modules for the platform Hadoop, providing a connection to NoSQL database MongoDB. Now MongoDB can act as a source of data for Hadoop (Map / Reduce, and other ecosystem components Hadoop). It is also possible, and the inverse operation – record the results in MongoDB.

Currently it supports basic API Hadoop (known as Hadoop Common ). Provided support for recording in MongoDB of Pig and Streaming for Map / Reduce in Python.
(more…)

02.9
12

Release of Apache Mahout 0.6

by admin ·

February 6th saw the release of Apache Mahout numbered 0.6.

Among the innovations:

  • Reducing the computation time of the scalar product of vectors.
  • Support for MongoDB and Cassandra DataModel.
  • Fixed some bugs and introduced some additional improvements.

A detailed list of changes is available on the website Apache Software Foundation .
(more…)

01.11
12

Apache Hadoop 1.0

by admin ·

Apache Software Foundation anonisiroval release Hadoop 1.0 – a popular platform for processing large volumes of data.

Apache Hadoop is widely used in Amazon, AOL, Apple, eBay, Facebook, foursquare, HP, LinkedIn, Netflix, The New York Times, Rackspace, Twitter, Yahoo and other companies.

Version 1.0 includes:

  • DBMS support Hbase – added functions to sync and flush, are required to support the transaction log
  • Kerberos-based authentication
  • (more…)