Planet MySQL

Displaying posts with tag: big data (reset)

Mar

2025

Oracle Technology Roundtable for Digital Natives – Let’s have a look at AI, Cloud and HeatWave

Posted by dbi services on Fri 07 Mar 2025 08:16 UTC
Tags:

Oracle, Security, cloud, olap, oltp, ETL, data, innovation, business intelligence, big data, performances, MySQL, Development & Performance, ai, OCI, ML, Heatwave, Cloud Native, Lakehouse, GenAI, machinelearning, objectstorage, objectstore, vectors

Yesterday I participated to the Oracle Technology Roundtable for Digital Natives in Zurich.

It was a good opportunity to learn more about AI, Cloud and HeatWave with the focus on very trendy features of this product: generative AI, machine learning, vector processing, analytics and transaction processing across data in Data Lake and MySQL databases.

It was also great to share moments with the Oracle and MySQL teams and meet customers which gave feedback and tips about their solutions already in place in this area.

I’ll try to summarize below some key take-away of each session.

Unlocking Innovation: How Oracle AI is Shaping the Future of Business (by Jürgen Wirtgen)

AI is not a new topic. But how do we …

[Read more]

Apr

2020

MySQL table to JSON with 10 lines of Spark

Posted by Kasra Madadipouya on Tue 21 Apr 2020 16:55 UTC
Tags:

Programming, big data, json, MySQL, spark, Apache Spark, MySQL to JSON

Apache Spark is the de facto framework of the big data world. Any serious organization that’s dealing with big data uses Spark almost exclusively. Though, it has some caveats. For the starter, it’s hard to use. And it’s very confusing to get started with, even for those with a solid …

The post MySQL table to JSON with 10 lines of Spark appeared first on Geeky Hacker.

Oct

2019

Managing Big Data with MySQL Can be Challenging

Posted by Monyog on Thu 24 Oct 2019 03:30 UTC
Tags:

big data, mysql Database, MySQL, SQL Diagnostic Manager for MySQL

Author: Robert Agar

MySQL is an extremely popular open-source database platform originally developed by Oracle. It currently is the second most popular database management system in the world, only trailing Oracle’s proprietary offering. If you aim to be a professional database administrator, knowledge of MySQL is almost a prerequisite. It is an important part of the multi-platform database environment found in the majority of IT departments.

A recent addition that has added to the complexity of managing a MySQL environment is the introduction of big data. Big data is characterized by the volume, velocity, and variety of information that is gathered and which needs to be processed. It can be used to provide an organization with the business …

[Read more]

Mar

2019

DBEvents: A Standardized Framework for Efficiently Ingesting Data into Uber’s Apache Hadoop Data Lake

Posted by Uber Engineering on Thu 14 Mar 2019 16:00 UTC
Tags:

big data, cassandra, schemaless, MySQL, Apache Hadoop, Data Analytics, Uber Data, Data Infrastructure, Hudi, Marmaray

Keeping the Uber platform reliable and real-time across our global markets is a 24/7 business. People may be going to sleep in San Francisco, but in Paris they’re getting ready for work, requesting rides from Uber driver-partners. At that same …

The post DBEvents: A Standardized Framework for Efficiently Ingesting Data into Uber’s Apache Hadoop Data Lake appeared first on Uber Engineering Blog.

Oct

2018

Peloton: Uber’s Unified Resource Scheduler for Diverse Cluster Workloads

Posted by Uber Engineering on Tue 30 Oct 2018 15:00 UTC
Tags:

Hardware, Architecture, data center, big data, cassandra, redis, capacity planning, MySQL, Apache Hadoop, Apache Spark, Uber, Uber Engineering, Cluster Management, Peloton, Unified Resource Scheduler, Workload Cluster

Cluster management, a common software infrastructure among technology companies, aggregates compute resources from a collection of physical hosts into a shared resource pool, amplifying compute power and allowing for the flexible use of data center hardware. At Uber, cluster management …

The post Peloton: Uber’s Unified Resource Scheduler for Diverse Cluster Workloads appeared first on Uber Engineering Blog.

Oct

2018

Percona Live Europe Presents: ClickHouse at Messagebird: Analysing Billions of Events in Real-Time*

Posted by Percona Community on Mon 29 Oct 2018 17:28 UTC
Tags:

Events, big data, MySQL, ClickHouse, Percona Live Europe 2018

We’ll look into how Clickhouse allows us to ingest a large amount of data and run complex analytical interactive queries at MessageBird,. We also present the business needs that brought ClickHouse to our attention and detail the journey to its deployment. We cover the problems we faced, and how we dealt with them. We talk about our current Cloud production setup and how we deployed and use it.

We are really enthusiastic to share a use case of Clickhouse, how it helped us to scale our analytics stack with the good, the bad and the ugly.

The talk could be useful to newcomers and everyone wondering if Clickhouse could be useful to them.

What we’re looking forward to…

There are many talks, but these are among the top ones we’re looking forward to in particular:

[Read more]

Oct

2018

One Billion Tables in MySQL 8.0 with ZFS

Posted by Alexander Rubin of MySQL Performance Blog on Mon 22 Oct 2018 14:22 UTC
Tags:

zfs, scalability, big data, MySQL Scalability, MySQL, MySQL 8.0

The short version

I created > one billion InnoDB tables in MySQL 8.0 (tables, not rows) just for fun. Here is the proof:

$ mysql -A
Welcome to the MySQL monitor.  Commands end with ; or \g.
Your MySQL connection id is 1425329
Server version: 8.0.12 MySQL Community Server - GPL
Copyright (c) 2000, 2018, Oracle and/or its affiliates. All rights reserved.
Oracle is a registered trademark of Oracle Corporation and/or its
affiliates. Other names may be trademarks of their respective
owners.
Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
mysql> select count(*) from information_schema.tables;
+------------+
| count(*)   |
+------------+
| 1011570298 |
+------------+
1 row in set (6 hours 57 min 6.31 sec)

Yes, it took 6 hours and 57 minutes to count them all!

Why does anyone need one billion tables?

In my previous blog post, I created and tested …

[Read more]

Oct

2018

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Posted by Uber Engineering on Wed 17 Oct 2018 16:00 UTC
Tags:

Apache, engineering, storage, Architecture, hadoop, data warehouse, big data, json, MySQL, Data Modeling, latency, Apache Hadoop, Docker, Apache Spark, Uber Data, PostgresSQL, hoodie, Apache Parquet, Hudi, Uber Eng

Uber is committed to delivering safer and more reliable transportation across our global markets. To accomplish this, Uber relies heavily on making data-driven decisions at every level, from forecasting rider demand during high traffic events to identifying and addressing bottlenecks…

The post Uber’s Big Data Platform: 100+ Petabytes with Minute Latency appeared first on Uber Engineering Blog.

Sep

2018

40 million tables in MySQL 8.0 with ZFS

Posted by Alexander Rubin of MySQL Performance Blog on Mon 03 Sep 2018 12:02 UTC
Tags:

zfs, Benchmarks, big data, MySQL, MySQL 8.0

In my previous blog post about millions of table in MySQL 8, I was able to create one million tables and test the performance of it. My next challenge is to create 40 million tables in MySQL 8 using shared tablespaces (one tablespace per schema). In this blog post I’m showing how to do it and what challenges we can expect.

Background

Once again – why do we need so many tables in MySQL, what is the use case? The main reason is: customer isolation. With the new focus on security and privacy (take GDPR for example) it is much easier and more beneficial to create a separate schema (or “database” in MySQL terms) for each customer. That creates a new set of challenges that we will need to solve. Here is the summary:

…[Read more]

Jul

2018

On Innovation. Interview with Scott McNealy

Posted by Roberto V. Zicari on Mon 02 Jul 2018 07:31 UTC
Tags:

Java, Open Source, Uncategorized, microsoft, database, Google, amazon, Apple, Government, Sun Microsystems, innovation, facebook, big data, james gosling, redis, Scott McNealy, MySQL, machine learning, IOT, ai, GDPR, Andy Bechtolsheim, Artificial Intelligence, Bill Joy, Cambridge Analytica, Curriki, Edge, K-12, On Innovation, Redis Labs, Wayin

“We made it a point to hire really smart, visionary people and then let them do their work.
I wanted to delegate and let people be in charge of things. My own decision-making process was to decide who got to decide. To make decisions, you have to first outline the problem, and if you hire really great people, they’re going to know more about the problem they’re dealing with than you ever will.”–Scott McNealy

I have interviewed Scott McNealy. Scott is a Silicon Valley pioneer, most famous for co-founding Sun Microsystems in 1982. We talked about Innovation, AI, Big Data, Redis, Curriki and Wayin.

RVZ

Q1. You co-Founded Sun Microsystems in 1982, and served as CEO and Chairman of the Board for 22 years. What are the main lessons learned in all these years?

Scott …

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links