Home |  MySQL Buzz |  FAQ |  Feeds |  Submit your blog feed |  Feedback |  Archive |  Aggregate feed RSS 2.0 English Deutsch Español Français Italiano 日本語 Русский Português 中文
Showing entries 1 to 28

Displaying posts with tag: data warehouse (reset)

theCube @ Hadoop Summit 2014 - Robert Hodges (Continuent) with John Furrier and Jeff Kelly on on real-time data loading from Oracle and MySQL into Hadoop.
+0 Vote Up -0Vote Down
The Hadoop Summit, a leading Apache Hadoop industry conference, has grown significantly over the years, and throughout the day, theCUBE, led by hosts John Furrier and Jeff Kelly, featured the best of thought leaders, use cases, data scientists, data analysts, and developers at the event. Watch yesterday's interview with Robert Hodges (CEO, Continuent) on real-time data loading from Oracle and
Continuent Delivers Real-Time Data to Cloudera | Business Wire
+0 Vote Up -0Vote Down
SAN JOSE, CA– May 6, 2014 – Continuent, Inc., a leading provider of open source database clustering and replication solutions, today announced that their recently announced Tungsten Replicator 3.0 solution has been certified by Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™. Continuent Tungsten Replicator 3.0 enables organizations to quickly and easily 
Setup & operate Tungsten webinar series
+1 Vote Up -0Vote Down
Don't miss your opportunity to learn about Continuent Tungsten via our free "Setup & Operate" webcast series. These free webcasts include live presentations and interactive Q&A.Webcast OverviewsSetup & Operate Tungsten ReplicatorMay 15th, 10:00 am PDTTungsten Replicator is an innovative and reliable tool that can solve your most complex replication problems. We will introduce Replicator
Replicate from Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
+0 Vote Up -0Vote Down
Oracle is the most powerful DBMS in the world. However, Oracle's expensive and complex replication makes it difficult to build highly available applications or move data in real-time to data warehouses and popular databases like MySQL. In this webinar-on-demand you will learn how Continuent Tungsten solves problems with Oracle replication at a fraction of the cost of other solutions and with less
Tungsten University: Load a Vertica data warehouse with MySQL data
+0 Vote Up -0Vote Down
  Continuent Tungsten offers real-time replication from MySQL to a variety of DBMS types including Vertica. In this Tungsten University webcast we will show you the details of setting up MySQL-to-Vertica replication, including the following topics: Introduction to Continuent Tungsten features for data warehouse loading Installation for MySQL to Vertica replication Best practices for
Exploring SAP HANA – Powering Next Generation Analytics
+0 Vote Up -0Vote Down
SAP HANA , having entered the data 2.0/3.0 space at the right time, has been getting traction lately; and there will be lot of users like me who wants to[...]
Data Science vs. Data Analytics
+1 Vote Up -0Vote Down
As this topic came up a few times this week for discussion at various places, I thought of composing a post on “Data Scientist vs. Data Analytics Engineer”; even though[...]
Typical “Big” Data Architecture
+1 Vote Up -0Vote Down
Here is the typical “Big” data architecture, that covers most components involved in the data pipeline. More or less, we have the same architecture in production in number of places[...]
Real-time Replication Between Oracle and Oracle, and Oracle and MySQL
+0 Vote Up -0Vote Down
Re Oracle is the most powerful database system in the world. However, Oracle's expensive and complex replication makes it difficult to build highly available applications or move data in real-time to data warehouses and popular databases like MySQL.  In this video (recording of our live webcast on 10/18/12) you will learn how Continuent Tungsten solves problems with Oracle replication at a
Webinar Thu 10/18: Real-time Replication Between Oracle and Oracle, and Oracle and MySQL
+0 Vote Up -0Vote Down
Oracle is the most powerful database system in the world. However, Oracle's expensive and complex replication makes it difficult to build highly available applications or move data in real-time to data warehouses and popular databases like MySQL. In this webinar you will learn how Continuent Tungsten solves problems with Oracle replication at a fraction of the cost of other solutions and with
Oracle High Availability and More with Continuent Tungsten
+0 Vote Up -0Vote Down
Oracle is the most powerful database system in the world. However, Oracle's expensive and complex replication makes it difficult to build highly available applications or move data in real-time to data warehouses and popular databases like MySQL. In this video (recording of our 9/13/12 webinar) you will learn how Continuent Tungsten solves problems with Oracle replication at a fraction of the
So now Hadoop's days are numbered?
+3 Vote Up -0Vote Down
Earlier this week we all read GigaOM's article with this title:
"Why the days are numbered for Hadoop as we know it"
I know GigaOM like to provoke scandals sometimes, we all remember some other unforgettable piece, but there is something behind it...

Hadoop today (after SOA not so long ago) is one of the worst case of an abused buzzword ever known to men. It's everything, everywhere, can cure illnesses and do "big-data" at the same time! Wow! Actually Hadoop is a software framework that supports data-intensive distributed applications, derived from Google's MapReduce and Google File System (GFS) papers.

My take from the article is




  [Read more...]
Scale differences between OLTP and Analytics
+1 Vote Up -0Vote Down

In my previous post,http://database-scalability.blogspot.com/2012/05/oltp-vs-analytics.html, I reviewed the differences between OLTP and Analytics databases.

Scale challenges are different between those 2 worlds of databases.



Scale challenges in the Analytics world are with the growing amounts of data. Most solutions have been leveraging those 3 main aspects: Columnar storage, RAM and parallelism.
Columnar storage makes scans and data filtering more precise and focused. After that – it all goes down to







  [Read more...]
Can’t Travel to Collaborate 12? Plug-in Virtually Instead! (revised schedule)
+0 Vote Up -0Vote Down
  Plug-in to Vegas The program focuses on key topics such as high availability, virtualization, security, business intelligence, Exadata, Cloud Computing and internals.  Recently added, we switched around the schedule to include the Thursday Deep Dive, Avoiding Downtime through the Maximum … Continue reading →
How analysing your binlogs can be quite informative
+1 Vote Up -0Vote Down

If you have used MySQL for some time you know that mysqld can write binlogs. This is usually used for backup purposes and JITR or for replication purposes so a slave can collect the changes made on the master and apply them locally.

Most of the time apart from configuring how long you keep these binlogs they are pretty much ignored.

Recently I came across an issue.  I have a slave server which is NOT configured read only and which has an additional database used to collect statistics from the replicated database and provided aggregation and business information. The typical sales per country, per product, per day, week, month, year, per whatever etc.  This is the usual datawarehouse type functionality.  It’s done on a slave and not the

  [Read more...]
Why Kickfire is a fail in MySQL Data warehouse
+4 Vote Up -1Vote Down
Even though Data warehouse is picking very rapidly in the last year or so, but few companies who are already made a right mark in the right time could not[...]
Data Warehousing Best Practices: Comparing Oracle to MySQL pt 2
+5 Vote Up -3Vote Down

At Kscope this year, I attended a half day in-depth session entitled Data Warehousing Performance Best Practices, given by Maria Colgan of Oracle. My impression, which was confirmed by folks in the Oracle world, is that she knows her way around the Oracle optimizer.

See part 1 for the introduction and talking about power and hardware. This part will go over the 2nd “P”, partitioning. Learning about Oracle’s partitioning has gotten me more interested in how MySQL’s partitioning works, and

  [Read more...]
Data Warehousing Best Practices: Comparing Oracle to MySQL pt 1
+4 Vote Up -3Vote Down

At Kscope this year, I attended a half day in-depth session entitled Data Warehousing Performance Best Practices, given by Maria Colgan of Oracle. My impression, which was confirmed by folks in the Oracle world, is that she knows her way around the Oracle optimizer.

These are my notes from the session, which include comparisons of how Oracle works (which Maria gave) and how MySQL works (which I researched to figure out the difference, which is why this blog post took a month after the conference to write). Note that I am not an expert on data warehousing in either Oracle or MySQL, so these are more concepts to think about than

  [Read more...]
Two webinars this week
+0 Vote Up -0Vote Down

I’m taking part in two webinar’s this week that will likely be of interest to CAOS readers. On WednesdayI’m contributing to a webinar with EnterpriseDB on the subject of open source database adoption in the enterprise, while on Thursday I’ll be presenting a 451 Group webinar on data warehousing.

During the EnterpriseDB webinar we will provide recommendations for how organizations can effectively leverage open source software. Attendees will learn about open source software trends for 2010, top considerations when using open source databases, and best practices for successful deployments of open source software.

I’ll be providing some data points from our recent surveys on database adoption and

  [Read more...]
High-Performance, Affordable, Open Data Marts
+0 Vote Up -0Vote Down

Departmental or subject-specific data warehouses – known as “data marts” in the industry – seem to be gaining in popularity.  Fueled partly by companies wanting to start small with focused projects in today’s economy, and partly by advances in data warehousing technology improving affordability and deployability, data marts seem to be popping-up everywhere.

In most cases, data mart projects are driven by the head of a business unit or a functional group (like Sales) needing to analyze their own slice of data in order to run their department more efficiently and effectively.  The data may come directly from an operational system or a combination of source systems resulting in what’s called an “independent data mart”, or it may come directly from a larger, enterprise data warehouse in a hub-and-spoke or “dependent data mart” configuration.

  [Read more...]
High-Performance, Affordable, Open Data Marts
+0 Vote Up -0Vote Down
Departmental or subject-specific data warehouses - known as "data marts" in the industry - seem to be gaining in popularity.  Fueled partly by companies wanting to start small with focused projects in today's economy, and partly by advances in data warehousing technology improving affordability and deployability, data marts seem to be popping-up everywhere. In most cases, data mart projects are driven by the head of a business unit or a functional group (like Sales) needing to analyze their own slice of data in order to run their department more efficiently and effectively.  The data may come directly from an operational system or a combination of source systems resulting in what's called an "independent data mart", or it may come directly from a larger, enterprise data warehouse in a hub-and-spoke or "dependent data mart" configuration. In either case today, according to industry analysts, companies  [Read more...]
Data Warehouse/Analytic Appliances – What to Consider
+0 Vote Up -0Vote Down

Why was Teradata able to become the leader of data warehousing at the super high-end (e.g. greater than 25 TB’s)?  Why was Netezza only the second pure-play data warehousing company to go public by focusing on the 10 – 25 TB range of opportunities?  Why did Oracle after so many years of denial finally announce a joint hardware / software product for data warehousing with HP, the Exadata data warehouse server?  Why did Microsoft acquire DATAllegro, one of the earlier data warehousing appliances? Why are there now dozens of data warehouse appliances available on the market today, and – more importantly – how should a customer choose which one to purchase? 

In all these cases, the vendors have listened to the market and concluded that the most optimal way to serve the customer is through a true data warehouse appliance.  Given that there are so many flavors of appliances, though,

  [Read more...]
Kickfire Launches MySQL Appliance for Data Warehousing Mass Market
+0 Vote Up -0Vote Down

The Kickfire MySQL Appliance is offically launched!

We just announced today, along with a new customer, and strategic partnerships with ten leading service companies including Percona, the MySQL performance experts.

Look for more news next week from Kickfire as we head into the MySQL conference. Kickfire will also give a keynote on the first day of the conference and will make a surprise announcement! Stay tuned …

Real Time Data Warehousing Presentation and Video
+0 Vote Up -0Vote Down

At the March Boston MySQL User Group meeting, Jacob Nikom of MIT’s Lincoln Laboratory presented “Optimizing Concurrent Storage and Retrieval Operations for Real-Time Surveillance Applications.” In the middle of the talk, Jacob said he sometimes calls what he did in this application as “real-time data warehousing”, which was so accurate I decided to give that title to this blog post.

The slides can be downloaded in PDF format (1.3 Mb) at http://www.technocation.org/files/doc/Concurrent_database_performance_02.pdf. The 54 minute video can be downloaded (644Mb) at http://technocation.org/node/693/download or streamed directly in your browser at

  [Read more...]
Kickfire Ships to First Web 2.0 Customer
+0 Vote Up -0Vote Down

We just shipped and installed the Kickfire appliance in the data center of our first web 2.0 customer this week. We’re very excited about this new customer. With already over a million active members, this company continues to grow in spite of a challenging economic environment because it has a clearly defined audience and a business model which adds value to its members while adding money to its coffers. Part of the value add to their member base comes from well-targeted discount and coupon offers. In order to achieve this, the company runs complex analytics to understand members’ behaviors and responses and uses this data to help its advertising customers better target their offers.

As with many web 2.0 companies, this customer has built its application on MySQL. MySQL has helped them scale their web application well but was presenting performance and scalability challenges for their

  [Read more...]
KickFire is Back
+0 Vote Up -0Vote Down
After receiving an email about talking with Robert David, Director of Sales at KickFire, I checked out recent news on KickFire, as there had been little written about this company aside from the big splash they made at the MySQL Users Conference back in April 2008.And, lo and behold, there was a piece of news, posted on October 14 - Kickfire Enters Into MySQL Enterprise Agreement With Sun
Building a data warehouse on a budget with MySQL 5.1
+0 Vote Up -0Vote Down

If there is one thing that a DBA or data warehouse architect can count on, it is that data volumes will increase while budgets will decrease.

This is why MySQL 5.1 and its partitioning capabilities are so interesting. I’m going to demonstrate how you can build a small/medium-sized data warehouse or data mart (1-10 TB range) on a shoe-string budget.

the mission

I decided to convert a relatively large statistics table (750m rows, 140GB in size in about 10 partitions) on a test machine from MyISAM to the Archive storage engine. After a long conversion process, my data, on disk, ended up being about 21GB, for an impressive compression ratio of 6.7:1.

Prior to MySQL 5.1, one of the drawbacks to the archive storage engine was that you could not index it; however, with partition pruning, you can get yourself a “free” index on a large archive table by splitting it into



  [Read more...]
Data Warehousing 101: The purpose of a data warehouse
+0 Vote Up -0Vote Down
When your company decides that "it is time to build a data warehouse", what thoughts come to mind?1) A magical fairy ice cream land where data is presented in chocolate shells for everyone to digest perfectly;2) A big literal warehouse in the industrial section of town with rusty old containers;3) Another place to put data, which means another place for you to track and monitor additional
Showing entries 1 to 28

Planet MySQL © 1995, 2014, Oracle Corporation and/or its affiliates   Legal Policies | Your Privacy Rights | Terms of Use

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.