Is there a good library for accessing HBase from Python? [closed]
I spent some time looking around, and all I could find is 开发者_如何学运维Jython. It's an option, but is there something that could be used in a more pythonesque (simpler) way?
Stargate is still in the contrib part of the Hbase project while ThriftServer is maintained in core(org.apache.hadoop.hbase.thrift). Grab the HBase.thrift file from the repository and run
thrift --gen py HBase.thrift
on it, shove the contents into wherever, and startup a thrift server.
Stargate is very very slow. The HBase thrift still has some work to be done on it, however it is still being actively worked on
A couple of places to get started
http://wiki.apache.org/hadoop/Hbase/ThriftApi
You can try thrift python bindings but the project seems dead. I'd go with starting HBase REST server, and then using python standard libraries to access that RESTful web service.
FWIW, I'm trying to get something started at http://github.com/hammer/pyhbase. Totally a hack right now but will be polishing it over the next few weeks. I link to the Mozilla client that I started from.
Also go through https://github.com/tousif/Hwrapper wrapper for the Hbase REST api.
精彩评论