Home |  MySQL Buzz |  FAQ |  Feeds |  Submit your blog feed |  Feedback |  Archive |  Aggregate feed RSS 2.0 English Deutsch Español Français Italiano 日本語 Русский Português 中文
Showing entries 1 to 30 of 45 Next 15 Older Entries

Displaying posts with tag: cassandra (reset)

Real-Time Replication from MySQL to Cassandra
+1 Vote Up -0Vote Down

Earlier this month I blogged about our new Hadoop applier, I published the docs for that this week (http://docs.continuent.com/tungsten-replicator-3.0/deployment-hadoop.html) as part of the Tungsten Replicator 3.0 documentation (http://docs.continuent.com/tungsten-replicator-3.0/index.html). It contains some additional interesting nuggets that will appear in future blog posts.

The main part of that functionality that performs the actual applier for Hadoop is based around a JavaScript applier engine – there will eventually be docs for that as part of the Batch Applier content (

  [Read more...]
New MySQL features, related technologies at Percona Live London
+0 Vote Up -0Vote Down

The upcoming Percona Live London conference, November 11-12, features quite a number of talks about the latest MySQL features and related technologies. There will be a lots of talks about the new MySQL 5.6 features:

  [Read more...]
Sending the query to the data
+0 Vote Up -0Vote Down

It’s common wisdom that large-scale database systems require distributing the data across machines. But what seems to be missing in a lot of discussions is distributing the query processing too. By this I mean the actual computation that’s performed on the data.

I just had a conversation with Peter Zaitsev yesterday that helped make concrete some thoughts I’ve been having about Cassandra for a while. Because Cassandra doesn’t allow you to really do any computation in the data (aggregating, evaluating expressions, and so on), if you’re going to use it for truly Big data, you’re going to fetch enormous amounts of data across the network. Sure, you’re distributing the storage and retrieval across many machines — but you’re locating your data far

  [Read more...]
On Oracle NoSQL Database –Interview with Dave Segleau.
+0 Vote Up -0Vote Down
“We went down the path of building Oracle NoSQL database because of explicit request from some of our largest Oracle Berkeley DB installations that wanted to move away from maintaining home grown sharding implementations and very much wanted an out of box technology that can replicate the robustness of what they had built “out of [...]
What technologies are you running alongside MySQL?
+2 Vote Up -0Vote Down

In many environments MySQL is not the only technology used to store in-process data.

Quite frequently, especially with large-scale or complicated applications, we use MySQL alongside other technologies for certain tasks of reporting, caching as well as main data-store for portions of application.

What technologies for data storage and processing do you use alongside MySQL in your environment? Please feel free to elaborate in the comments about your use case and experiences!

Note: There is a poll embedded within this post, please visit the site to participate in this post's poll.

The post

  [Read more...]
On PostgreSQL. Interview with Tom Kincaid.
+0 Vote Up -1Vote Down
“Application designers need to start by thinking about what level of data integrity they need, rather than what they want, and then design their technology stack around that reality. Everyone would like a database that guarantees perfect availability, perfect consistency, instantaneous response times, and infinite throughput, but it´s not possible to create a product with [...]
MHA & MariaDB Cassandra Interoperability talks
+1 Vote Up -0Vote Down

Together with Sergei Petrunia, we talked about MariaDB Cassandra Interoperability. Sergei has done wonderful work here and I plan to showcase more Cassandra integration going forward. It’s worth noting that Elena came up with the benchmarks for this talk as well.

MariaDB Cassandra Interoperability from Colin Charles

Later, I gave a talk on MHA. Competing in a timeslot with

  [Read more...]
Introducing Data Fabric Design for Commodity SQL Databases
+3 Vote Up -0Vote Down
Extract from THE SCALE-OUT BLOG by Robert Hodges (CEO, Continuent)http://scale-out-blog.blogspot.com Data management is undergoing a revolution. Many businesses now depend on data sets that vastly exceed the capacity of DBMS servers. Applications operate 24x7 in complex cloud environments using small and relatively unreliable VMs. Managers need to act on new information from those systems in
Impressions of FOSDEM 13
+2 Vote Up -1Vote Down
FOSDEM 13 is now over, I am on my way home and I would like to share some thoughts sparkled by the intense atmosphere that I have lived in these two days in Brussels.

I have to admit, I was tempted to skip FOSDEM this year: the last 3 weeks have been crazily busy and I travelled a lot, leaving my family alone for too long. But now I am glad I made the effort.
For once, I will leave the comments around the MySQL talks to others, I am sure there will be posts on the matter. In this post I want to cover some of the non-MySQL talks and discussions I have attended and heard. Of course the topics that have been discussed are still relevant for MySQL and for the future of the MySQL ecosystem.
First of all, the number of attendees was impressive. The organisers claim there were more than 5,000 open source enthusiasts,



  [Read more...]
Impressions of FOSDEM 13
+0 Vote Up -0Vote Down
FOSDEM 13 is now over, I am on my way home and I would like to share some thoughts sparkled by the intense atmosphere that I have lived in these two days in Brussels. I have to admit, I was tempted to skip FOSDEM this year: the last 3 weeks have been crazily busy and I travelled a lot, leaving my family alone for too long. But now I am glad I made the effort.  

read more

The Data Day, A few days: January 2-4, 2013
+0 Vote Up -0Vote Down

Apache Cassandra and BigTop updates. And more

For 451 Research clients: Kx Systems aims to slipstream the ‘big data’ bandwagon to expanded kdb+ adoption bit.ly/VxkYlC

— Matt Aslett (@maslett) January 3, 2013

For 451 Research clients: IBM sheds light on ‘big data’ integration and governance the Big Blue way bit.ly/ZTAZcw By Krishna Roy

— Matt Aslett (@maslett) January 2, 2013

The Apache Software Foundation Announces Apache Cassandra v1.2 bit.ly/UFGKFN

— Matt Aslett (@maslett)

  [Read more...]
Announcing the Cassandra Storage Engine
+7 Vote Up -0Vote Down

We’re pleased to announce the first preview version of the Cassandra Storage Engine!

The Cassandra Storage Engine (SE) allows access to Cassandra databases from MariaDB/MySQL, and to provide data integration between the SQL and NoSQL worlds.

Have you ever needed to

  • grab some of Cassandra’s data from your web frontend, or SQL query?
  • insert a few records into Cassandra from some part of your app?

With Cassandra SE, this is easily possible. Cassandra SE makes Cassandra’s column families appear as MariaDB/MySQL tables that you can insert to, update, and select from. You can perform joins on Cassandra data, or againist data in Cassandra and data in MariaDB.

Today we’re releasing a source tarball, as

  [Read more...]
MariaDB Directions
+5 Vote Up -0Vote Down

Infor announced this week, that they will provide open source database alternatives to some of their products. MariaDB has been chosen, tested, and certified by Infor to be the open source database of choice (together with MySQL) for the Infor LN and ION products. Infor LN is Infor’s flagship ERP and is sometimes better known by its former name, Baan. It has 25 years of manufacturing know-how built into it and is used by more than 5,000 companies worldwide in a wide range of industries. These include automotive, industrial equipment and machinery, high tech and electronics, and aerospace and defense. This is a big stamp of approval that even the most critical systems can be run on MariaDB.

In other news, there are currently several really interesting paths coming together into some important

  [Read more...]
Exploring Configuration Management with Ansible
+0 Vote Up -0Vote Down

 

What is Ansible?

Ansible is a configuration management and deployment system, like Puppet, Capistrano, Fabric, and Chef. Its aim is to be radically simple and let you use your existing scripts to help with cluster configuration and software deployment whenever possible. Here are the ways that Ansible differentiates itself.

Simplicity

Ansible does not include a client/server architecture with pull-based clients (although in more recent versions, it does include pull-based configuration and deployment). Rather, it uses pre-existing network infrastructure: SSH. Every company has SSH installed on their cluster servers, and Ansible simply rides on top of this infrastructure to get the code and configuration out to the nodes.

Language

  [Read more...]
Setting up RAID0 in Ubuntu 12.04 in AWS High I/O
+0 Vote Up -0Vote Down

Amazon announced high I/O instances today. This is huge for anyone with a database larger than available memory, as it’s been a complete nightmare dealing with EBS up till now. Now your Cassandra, MongoDB, MySQL, or whatever your using should be able to perform well without requiring keeping your entire dataset in memory.

With each instance you get 2x1TB of disk. In this tutorial I’ll be setting it up as a RAID0 to get a single 2TB disk which should deliver excellent performance.

Before you get started, make sure you’ve got mdadm installed:

apt-get install mdadm

To begin, check fdisk and make sure your 1TB drives are mounted.

root@ip-10-140-128-232:~# fdisk -l

Disk /dev/xvda1: 8589 MB, 8589934592 bytes
255 heads, 63 sectors/track, 1044

  [Read more...]
Disaster recovery node on Ephemeral Cassandra cluster
+0 Vote Up -0Vote Down

Ephemeral Nodes in EC2 are good and bad... No guaranteed storage (like EBS, Elastic Block Storage), but you get guaranteed full disk bandwidth, which you can make even better if you RAID0 the disks.

Suppose you built the Cassandra cluster by making every node, but one, an ephemeral node...
And then you set up ONE node, as EBS backed up node (with unpredictable or relatively bad performance).
Then you set up that node to be the seed node for all other nodes, which makes schema management even easier.

On all ephemeral nodes, set up Snitch (in cassandra.yaml) as:

read more

Fast Cassandra nodes in AWS EC2
+0 Vote Up -0Vote Down

Build a Cassandra cluster out of 3+ m1.large nodes, using ephemeral storage...

Once you start building a node with ephemeral storage, it makes no longer sese to do RAID1 or such -- any hickup, and it's all going to be blown away anyway...

The trick is to build a seed-node with EBS-storage and change the Snitch -- look for my posting on how to do that.

This ec2 incantation shows you, how to get the two ephemeral drives as sdb and sdc:
ec2-run-instances ami-5139f538 -t m1.large -g sg-b5eff2d9 -s subnet-d5c7fdbc -b /dev/sdb=ephemeral0 -b /dev/sdc=ephemeral1

read more

Open APIs are the new open source
+1 Vote Up -0Vote Down

We’ve seen the rise of open source software in the enterprise and also beyond the IT industry, but the real keys to openness and its advantages in today’s technology world — where efficient use of cloud computing and supporting services are paramount — exist in open application programming interfaces, or APIs.

Open source software continues to be a critical part of software development, systems administration, IT operations and more, but much of the action in leveraging modern cloud computing and services-based infrastructures centers on APIs. Open APIs are the new open source.

Read the full story at LinuxInsider.

Vote for MySQL[plus] awards 2011 !
+5 Vote Up -0Vote Down

First of all, I wish you a happy new year.
Many things happened last year, it was really exciting to be involved in the MySQL ecosystem.
I hope this enthusiasm will be increased this year, up to you !

To start the year, I propose the MySQL[plus] Awards 2011
It will only take 5 minutes to fill out these polls.
Answer with your heart first and then with your experience with some of these tools or services.

Polls will be closed January 31, so, vote now !
For “other” answers, please,  let me a comment with details.

Don’t hesitate to submit proposal for tools or services in the comments.






  [Read more...]
451 CAOS Links 2011.10.18
+0 Vote Up -0Vote Down

DOCOMO adopts, invests in Couchbase. Apache Cassandra reaches 1.0. And more.

# DOCOMO Innovations adopted Couchbase as DOCOMO Capital invested in the NoSQL database vendor.

# The Apache Software Foundation announced Apache Cassandra v1.0.

# Nuxeo announced the availability of Nuxeo Cloud.

# SGI formed a distribution relationship with Cloudera and

  [Read more...]
Measuring the scalability of SQL and NoSQL systems.
+0 Vote Up -0Vote Down
“Our experience from PNUTS also tells that these systems are hard to build: performance, but also scaleout, elasticity, failure handling, replication. You can’t afford to take any of these for granted when choosing a system. We wanted to find a way to call these out.” – Adam Silberstein and Raghu Ramakrishnan, Yahoo! Research. ___________________________________ A [...]
Open Database camp 2011 - Travel logistics, and don't forget the party
+6 Vote Up -0Vote Down
The Open Database Camp 2011 is near. In 9 days, the welcome party starts, and then the conference itself gets going.
If you are coming earlier than Friday, May 6th, you can either use public transportation or book a private seat with a volunteer in the car pooling page. Please help the organizers: post your arrival and departure dates and times, so we may be able to help you even outside the official conference days.
About the conference itself, as everyone should know, it's a

  [Read more...]
451 CAOS Links 2011.03.25
+0 Vote Up -0Vote Down

Red Hat grows revenue 20%+. Google withholding Honeycomb source code. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Red Hat reported Q4 revenue up 25% to $245m, FY revenue up 22% to $909m

# Google is withholding the source code to Honeycomb for the foreseeable future.

# Rick Clark explained why he left Rackspace amid concerns that the company is exerting too much control over OpenStack.

# DataStax launched Brisk, a Hadoop/Hive


  [Read more...]
451 CAOS Links 2011.01.18
+0 Vote Up -0Vote Down

Funding for OpenGamma. Riptano becomes OpenStax. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# OpenGamma raised $6m series B funding.

# Apache Cassandra-supporter Riptano changed its name to DataStax and has added 50 customers in 6 months.

# WANdisco acquired the SVNForum.org Subversion user community.

# Univa hired the principal engineers from the Grid Engine team, will publish a Univa version of Grid Engine before the end of Q1.


  [Read more...]
How Real is the Data Deluge?
+1 Vote Up -1Vote Down

It seems obvious that given the decreasing cost of storage and computation, there's going to be a significant increase in the volume of data that organizations accumulate over the next 10 years.  But the type of data being accumulated may be different from the areas where traditional DBMSs dominated.  It's not just about transactions; it's search patterns, on-line behavior, click-thru data, events fired off by smartphones, messages over Twitter & Facebook, log data of various kinds.

If an organization can figure out a better way identify prospects, or deliver more targeted ads, or optimize pricing decisions by analyzing terrabytes of data, they'd be crazy not to. Over the long term, companies

  [Read more...]
451 CAOS Links 2010.10.08
+0 Vote Up -0Vote Down

Patents! Patents! Patents! Canonical’s perfect 10. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca, and daily at Paper.li/caostheory
“Tracking the open source news wires, so you don’t have to.”

# Google responded to Oracle’s claims that its Android OS infringes copyrights and patents related to Java.

# Matt Asay evaluated the various patent claims against Android and its related devices.

# Microsoft licensed smartphone patents from ACCESS Co and a subsidiary of Acacia Research.

# Glyn Moody assessed what Microsoft’s


  [Read more...]
LCA Miniconf Call for Papers: Data Storage: Databases, Filesystems, Cloud Storage, SQL and NoSQL
+3 Vote Up -0Vote Down

This miniconf aims to cover many of the current methods of data storage and retrieval and attempt to bring order to the universe. We’re aiming to cover what various systems do, what the latest developments are and what you should use for various applications.

We aim for talks from developers of and developers using the software in question.

Aiming for some combination of: PostgreSQL, Drizzle, MySQL, XFS, ext[34], Swift (open source cloud storage, part of OpenStack), memcached, TokyoCabinet, TDB/CTDB, CouchDB, MongoDB, Cassandra, HBase….. and more!

Call for Papers open NOW (Until 22nd October).

Do We Need a New Programming Language for Big Data?
+0 Vote Up -0Vote Down


 

I'm the boards of two companies (Pentaho, Revolution Analytics) that are starting to see a lot of customer traction around Big Data. More and more companies in media, pharma, retail and finance are doing advanced analysis, reporting, graphing, etc with massive data sets. It made me wonder what other areas of the technology stack might evolve with the trend towards Big Data.  Obviously, there's new middleware layers like Hadoop and Map Reduce, and we're also seeing the emergence of


  [Read more...]
Digg’s main competitor (Reddit) runs Cassandra but their VP of Engineering was fired for the decision to switch.
+3 Vote Up -0Vote Down

Apparently, Digg performed a big migration from MySQL to Cassandra and a big migration to their new Digg v4 architecture and now their VP of Engineering has been shown the door:

Ever since Digg launched its new site design, it’s been plagued with all kinds of trouble, not least of which is that it keeps going down. The problems with the new architecture are so bad that VP of Engineering John Quinn is now gone, we’ve confirmed with sources close to Digg.

In a Diggnation video today, CEO Kevin Rose explained some of the technical issues the site is dealing with and why it can’t simply roll back to the previous architecture. The new version of Digg, v4, is based on a distributed database called

  [Read more...]
Cassandra and Ganglia
+1 Vote Up -0Vote Down


I finally got some time to do some house cleaning. One of my nagging low-hanging fruit jobs was to stop using jconsole as my monitor. I created a ganglia script to graph what is above. The image illustrated above I am showing all the Cassandra servers and their total row read stages completed in the last hour as a gauge. In essence I am graphing the delta of the change between ganglia script runs.

How I have it set up is:

All data exposed by JMX to produce tpstats and cfstats is graphed via ganglia. The pattern for each graph is as follows

cass_{stat_class}_{key}

stat_class - tpc, tpp, tpa means complete, pending, active respectively
key - would be message deserialization for instance.










  [Read more...]
Showing entries 1 to 30 of 45 Next 15 Older Entries

Planet MySQL © 1995, 2014, Oracle Corporation and/or its affiliates   Legal Policies | Your Privacy Rights | Terms of Use

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.