Planet MySQL

Displaying posts with tag: vertica (reset)

Aug

2018

Databook: Turning Big Data into Knowledge with Metadata at Uber

Posted by Uber Engineering on Fri 03 Aug 2018 15:30 UTC
Tags:

postgres, Infrastructure, metadata, Architecture, data warehouse, Data Management, vertica, cassandra, quartz, Hive, MySQL, hdfs, Kafka, gradle, Uber, Uber Data, Data Storage, Databook, Dropwizard, Queryparser, RESTful API, Uber Data Knowledge, Uber Engineering

From driver and rider locations and destinations, to restaurant orders and payment transactions, every interaction on Uber’s transportation platform is driven by data. Data powers Uber’s global marketplace, enabling more reliable and seamless user experiences across our products for riders, …

The post Databook: Turning Big Data into Knowledge with Metadata at Uber appeared first on Uber Engineering Blog.

May

2018

Comparing MySQL to Vertica Replication under MemCloud, AWS and Bare Metal

Posted by MC Brown on Thu 24 May 2018 18:23 UTC
Tags:

Articles, Commentary, vertica, MySQL, tungsten-replicator, memcloud

Back in December, I did a detailed analysis for getting data into Vertica from MySQL using Tungsten Replicator, all within the Kodiak MemCloud.

I got some good numbers towards the end – 1.9 million rows/minute into Vertica. I did this using a standard replicator deployment, plus some tweaks to the Vertica environment. In particular:

Integer hash for a partition for both the staging and base tables
Some tweaks to the queries to ensure that we used the partitions in the most efficient manner
Optimized the batching within the applier to hit the right numbers for the transaction counts

That last one is a bit of a cheat because in a real-world situation it’s much harder to be able to identify those transaction sizes and row counts, but for testing, we’re trying to get the best performance!

Next what I wanted to do was set up some bare metal and AWS servers that were of an …

[Read more]

Dec

2017

Analytical Replication Performance from MySQL to Vertica on MemCloud

Posted by MC Brown on Thu 14 Dec 2017 13:43 UTC
Tags:

Articles, vertica, MySQL, tungsten-replicator, memcloud

I’ve recently been trying to improve the performance of the Vertica replicator, particularly in the form of the of the new single schema replication. We’ve done a lot in the new Tungsten Replicator 5.3.0 release to improve (and ultimately support) the new single schema model.

As part of that, I’ve also been personally looking to Kodiak MemCloud as a deployment platform. The people at Kodiak have been really helpful (disclaimer: I’ve worked with some of them in the past). MemCloud is a high-performance cloud platform that is based on hardware with high speed (and volume) RAM, SSD and fast Ethernet connections. This means that even without any adjustment and tuning you’ve got a fast platform to work on.

However, if you are willing to put in some extra time, you can tune things further. Once you have a super quick …

[Read more]

Dec

2017

Analytical Replication Performance from MySQL to Vertica on MemCloud

Posted by MC Brown on Thu 14 Dec 2017 13:43 UTC
Tags:

Articles, vertica, MySQL, tungsten-replicator, memcloud

However, if you are willing to put in some extra time, you can tune things further. Once you have a super quick …

[Read more]

Oct

2017

Continuent Road Map: One year after restart… Where next?

Posted by Petri Virsunen of Continuent on Thu 19 Oct 2017 19:23 UTC
Tags:

Oracle, continuent, tungsten, vertica, cassandra, MySQL, elasticsearch, Kafka, #redshift

You may know Continuent Tungsten for our highly advanced MySQL replication tool, Tungsten Replicator, and for our state-of-the-art MySQL clustering solution, Tungsten Clustering. Our solutions are used by leading SaaS vendors, e-commerce, financial services and telco customers.

But there are more, many more, Tungsten deployments out there. Tungsten Replicator can be used for real-time data

Jun

2017

On Apache Ignite, Apache Spark and MySQL. Interview with Nikita Ivanov

Posted by Roberto V. Zicari on Fri 30 Jun 2017 13:40 UTC
Tags:

Uncategorized, sql, memcached, data warehousing, analytics, hadoop, mysq, Gridgain, SaaS, big data, vertica, redis, internet of things, machine learning, Tableau, Apache Ignite, Nikita Ivanov, proxysql, Apache Spark, vitess, ClickHouse, Apache Ignite In-Memory SQL Grid, Apache Kafka, ETL processes, in-memory computing, in-memory data grids, Spark Streaming

“Spark and Ignite can complement each other very well. Ignite can provide shared storage for Spark so state can be passed from one Spark application or job to another. Ignite can also be used to provide distributed SQL with indexing that accelerates Spark SQL by up to 1,000x.”–Nikita Ivanov.

I have interviewed Nikita Ivanov,CTO of GridGain.
Main topics of the interview are Apache Ignite, Apache Spark and MySQL, and how well they perform on big data analytics.

RVZ

Q1. What are the main technical challenges of SaaS development projects?

Nikita Ivanov: SaaS requires that the applications be highly responsive, reliable and web-scale. SaaS development projects face many of the same challenges as …

[Read more]

Jul

2015

Replication in real-time from Oracle and MySQL into data warehouses and analytics

Posted by Petri Virsunen of Continuent on Thu 23 Jul 2015 21:00 UTC
Tags:

Oracle, data warehouse, mysql replication, big data, vertica, mapr, MySQL, HortonWorks, Apache Hadoop, Data Analytics, database replication, Amazon Redshift, HP Vertica

Practical tips and a live demo of how to get your data warehouse loading projects off the ground quickly and efficiently when replicating from MySQL and Oracle into Amazon Redshift, HP Vertica and Hadoop.

Webinar-on-demand. Recorded 07/23/15.

Oct

2014

An Ending and a Beginning: VMware Has Acquired Continuent

Posted by Robert Hodges on Wed 29 Oct 2014 15:00 UTC
Tags:

postgresql, Oracle, cloud computing, hadoop, tungsten, SaaS, big data, mariadb, NoSQL, vertica, MySQL, Data fabric

As of today, Continuent is part of VMware. We are absolutely over the moon about it.

You can read more about the news on the VMware vCloud blog by Ajay Patel, our new boss. There’s also an official post on our Continuent company blog. In a nutshell the Continuent team is joining the VMware Cloud Services Division. We will continue to improve, sell, and support our Tungsten products and work on innovative integration into VMware’s product line.

So why do I feel exhilarated about joining VMware? There are three reasons.

1. Continuent is joining a world-class company that is the leader in virtualization and cloud infrastructure solutions. Even …

[Read more]

Sep

2014

Replicating from MySQL to Amazon Redshift

Posted by Petri Virsunen of Continuent on Fri 05 Sep 2014 01:00 UTC
Tags:

Oracle, amazon, hadoop, data warehouse, mysql replication, vertica, MySQL, database replication, #mysql, redshift

Continuent is delighted to announce an exciting Continuent Tungsten feature addition for MySQL users: replication in real-time from MySQL into Amazon RedShift.

In this webinar-on-demand we survey Continuent Tungsten capabilities for data warehouse loading, then zero in on practical details of setting up replication from MySQL into RedShift. We cover:

Introduction to real-time movement

May

2014

Webinar-on-Demand: Set Up & Operate Open Source Oracle Replication

Posted by Petri Virsunen of Continuent on Fri 30 May 2014 18:41 UTC
Tags:

Oracle, hadoop, vertica, MySQL, Continuent Tungsten, Continuent Tungsten Replicator

Oracle's expensive and complex replication makes it difficult to build cost-effective applications that move data in real-time to data warehouses (Oracle, Hadoop, Vertica) and popular databases like MySQL. Fortunately, Continuent Tungsten offers a solution.In this virtual course, you will learn how Continuent Tungsten solves problems with Oracle replication at a fraction of the cost of other

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links