Planet MySQL

Displaying posts with tag: ETL (reset)

Mar

2025

Oracle Technology Roundtable for Digital Natives – Let’s have a look at AI, Cloud and HeatWave

Posted by dbi services on Fri 07 Mar 2025 08:16 UTC
Tags:

Oracle, Security, cloud, olap, oltp, ETL, data, innovation, business intelligence, big data, performances, MySQL, Development & Performance, ai, OCI, ML, Heatwave, Cloud Native, Lakehouse, GenAI, machinelearning, objectstorage, objectstore, vectors

Yesterday I participated to the Oracle Technology Roundtable for Digital Natives in Zurich.

It was a good opportunity to learn more about AI, Cloud and HeatWave with the focus on very trendy features of this product: generative AI, machine learning, vector processing, analytics and transaction processing across data in Data Lake and MySQL databases.

It was also great to share moments with the Oracle and MySQL teams and meet customers which gave feedback and tips about their solutions already in place in this area.

I’ll try to summarize below some key take-away of each session.

Unlocking Innovation: How Oracle AI is Shaping the Future of Business (by Jürgen Wirtgen)

AI is not a new topic. But how do we …

[Read more]

Mar

2018

On RDBMS, NoSQL and NewSQL databases. Interview with John Ryan

Posted by Roberto V. Zicari on Fri 09 Mar 2018 11:05 UTC
Tags:

Oracle, Open Source, Uncategorized, Databases, ibm, interview, amazon, RDBMS, ETL, data warehouse, big data, NoSQL, mongodb, cassandra, redis, voltdb, newsql, MySQL, storm, MemSQL, Amazon Redshift, CockroachDB, Spark Streaming, Michael Stonebraker, Flink, Google Big Query, John Ryan, Lambda Architecture, Microsoft and, Snowflake, UBS

“The single most important lesson I’ve learned is to keep it simple. I find designers sometimes deliver over-complex, generic solutions that could (in theory) do anything, but in reality are remarkably difficult to operate, and often misunderstood.”–John Ryan

I have interviewed John Ryan, Data Warehouse Solution Architect (Director) at UBS.

RVZ

Q1. You are an experienced Data Warehouse architect, designer and developer. What are the main lessons you have learned in your career?

John Ryan: The single most important lesson I’ve learned is to keep it simple. I find designers sometimes deliver over-complex, generic solutions that could (in theory) do anything, but in reality are remarkably difficult to operate, and often misunderstood. I believe this stems from a lack of understanding of the …

[Read more]

Feb

2018

Incremental MYSQL loads to BigQuery using Matillion

Posted by Searce Engineering on Thu 15 Feb 2018 12:01 UTC
Tags:

ETL, data, MySQL, bigquery, google-cloud-platform

As part of building an enterprise DW for one of our customers we had to sync a bunch of tables from a MYSQL slave to BigQuery at 30 min intervals. Considering the range of other non-relational data sources which will be part of the this load, we chose Matillion as ETL tool. Matillion is easy to setup (just provision the VM and start authoring jobs) and long list of integrations so it made sense.

This post explains building a Matillion job that does the following:

Full Load
Incremental load for tables with larger row count and an ID that can be looked up for new rows since last load.

MYSQL Drivers

If you came from a Google search looking for Matillion — I am assuming you are done with provisioning the instance, setting up default project etc are done, so I am skipping those. While Matillion ships with PostgreSQL drivers, for some reason it doesn’t have MYSQL …

[Read more]

Jan

2017

What products & improvements are new on AWS?

Posted by Sean Hull on Sat 07 Jan 2017 03:45 UTC
Tags:

ETL, analytics, cloud computing, BI, aws, Database Management, mariadb, NoSQL, RDS, MySQL, Database Operations, redshift, athena, elt, quicksight

Amazon is releasing new products & services to it’s global cloud compute network at a rate that has all of our heads spinning. Join 32,000 others and follow Sean Hull on twitter @hullsean. Here’s new stuff worth mentioning around databases & data. 1. For ETL – AWS GLUE Moving data from your transactional MySQL or … Continue reading What products & improvements are new on AWS? →

Dec

2015

Using JSON’s Arrays for MariaDB Dynamic Columns

Posted by Serge Frezefond on Fri 04 Dec 2015 14:04 UTC
Tags:

community, Development, ETL, analytics, BI, mariadb, json, MySQL

The JSON format includes the concept of array. A JSON object cant contain an attribute of array type. We have seen that we can use the MariaDB CONNECT Storage Engine provided UDFs (user defined functions) to implement dynamic columns. Let us create a table with a text column containing a a JSON string and let ...continue reading "Using JSON’s Arrays for MariaDB Dynamic Columns"

Nov

2015

MariaDB CONNECT Storage Engine JSON Autodiscovery

Posted by Serge Frezefond on Tue 24 Nov 2015 13:52 UTC
Tags:

community, ETL, mariadb, MySQL

The MariaDB CONNECT storage engine offers access to JSON file and allows you to see a external JSON file as a MariaDB table. A nice feature of the CONNECT storage Engine is its capability to auto discover a table structure when the table correspond to external data. In our case the CONNECT storage engine will automatically [...]

Jun

2015

Log Buffer #429: A Carnival of the Vanities for DBAs

Posted by The Pythian Group on Fri 26 Jun 2015 12:47 UTC
Tags:

Oracle, innodb, Log Buffer, DBA, Pythian, slave, SQL Server, ETL, alter, mutex, Azure, OTN, MySQL, Fahd Mirza, FMTONLY, June 26 2015

This Log Buffer Edition gathers a wide sample of blogs and then purifies the best ones from Oracle, SQL Server and MySQL.

Oracle:

If you take a look at the “alter user” command in the old 9i documentation, you’ll see this: DEFAULT ROLE Clause.
There’s been an interesting recent discussion on the OTN Database forum regarding “Index blank blocks after a large update that was rolled back.”
12c Parallel Execution New Features: 1 SLAVE distribution
Index Tree Dumps in Oracle 12c …

[Read more]

Aug

2014

Resources for Database Clusters: Performance Tuning for HAProxy, Support for MariaDB 10, Technical Blogs & More

Posted by Severalnines on Thu 28 Aug 2014 07:28 UTC
Tags:

Tools, Other, ha, Nginx, High Availability, webinar, ETL, analytics, hadoop, performance tuning, big data, mariadb, mongodb, haproxy, MySQL, clustercontrol

August 28, 2014 By Severalnines Check Out Our Latest Resources for MySQL, MariaDB & MongoDB Clusters

Here is a summary of resources & tools that we’ve made available to you in the past weeks. If you have any questions on these, feel free to contact us!

New Technical Webinars

Performance Tuning of HAProxy for Database Load Balancing

09 September 2014 - with Baptiste Assmann of HAProxy Technologies

Do you know what HAProxy can tell you about your application and database instances? Do you know the difference between …

[Read more]

Jun

2014

Big Data Integration & ETL - Moving Live Clickstream Data from MongoDB to Hadoop for Analytics

Posted by Severalnines on Mon 16 Jun 2014 08:15 UTC
Tags:

Other, Data Integration, ETL, Migration, analytics, hadoop, talend, data migration, big data, mongodb, MySQL, hdfs, tokumx, clickstream

June 16, 2014 By Severalnines

MongoDB is great at storing clickstream data, but using it to analyze millions of documents can be challenging. Hadoop provides a way of processing and analyzing data at large scale. Since it is a parallel system, workloads can be split on multiple nodes and computations on large datasets can be done in relatively short timeframes. MongoDB data can be moved into Hadoop using ETL tools like Talend or Pentaho Data Integration (Kettle).

In this blog, we’ll show you how to integrate your MongoDB and Hadoop datastores using Talend. We have a MongoDB database collecting clickstream data from several websites. We’ll create a job in Talend to extract the documents from MongoDB, transform and then load them into HDFS. We will also show you how to schedule this job to be executed every 5 minutes.

Test Case

We have an application …

[Read more]

Jan

2014

MariaDB CONNECT Storage Engine as an ETL (or ELT) ?

Posted by Serge Frezefond on Tue 28 Jan 2014 15:25 UTC
Tags:

Oracle, ETL, mariadb, MySQL

The MariaDB CONNECT Storage Engine allows to access heterogeneous data sources. In my previous post I show you how to use the MariaDB CONNECT Storage Engine to access an Oracle database. This is quite easy through the CONNECT Storage Engine ODBC table type.

For most architectures where heterogeneous databases are involved an ETL (Extract-Transform-Load) is [...]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links