Planet MySQL

Displaying posts with tag: metadata (reset)

Aug

2018

Databook: Turning Big Data into Knowledge with Metadata at Uber

Posted by Uber Engineering on Fri 03 Aug 2018 15:30 UTC
Tags:

postgres, Infrastructure, metadata, Architecture, data warehouse, Data Management, vertica, cassandra, quartz, Hive, MySQL, hdfs, Kafka, gradle, Uber, Uber Data, Data Storage, Databook, Dropwizard, Queryparser, RESTful API, Uber Data Knowledge, Uber Engineering

From driver and rider locations and destinations, to restaurant orders and payment transactions, every interaction on Uber’s transportation platform is driven by data. Data powers Uber’s global marketplace, enabling more reliable and seamless user experiences across our products for riders, …

The post Databook: Turning Big Data into Knowledge with Metadata at Uber appeared first on Uber Engineering Blog.

May

2016

How to Deal with MetaData Lock

Posted by The Pythian Group on Thu 05 May 2016 13:59 UTC
Tags:

metadata, MySQL, Technical Track

What is MetaData Lock?

MySQL uses metadata locking to manage concurrent access to database objects, and to ensure data consistency when performing modifications to the schema: DDL operations. Metadata locking applies not just to tables, but also to schemas and stored programs (procedures, functions, triggers, and scheduled events).

In this post I am going to cover metadata locks on tables and triggers, that are usually seen by DBAs during regular operations/maintenance.

Kindly refer to these 4 different connections to MySQL Instance:

The screenshot shows that the uncommitted transaction may cause metadata lock to ALTER operations. The ALTER will not proceed until the transaction is committed or rolled-back. What is worse, after the ALTER is issued, any queries to that table (even simple SELECT queries) will be blocked. If the ALTER operation is an …

[Read more]

Feb

2014

Tracking Metadata Locks (MDL) in MariaDB 10.0

Posted by Chris Calender on Mon 03 Feb 2014 23:42 UTC
Tags:

metadata, performance_schema, skysql, MySQL, metadata locking, mdl, metadata lock, MDL locking, MDL bottleneck, metadata Lock bottleneck, metadata_locks, myisam metadata lock, mysql metadata lock, table_handles, tracking mdl lock, Tracking MDL locks in MySQL 5.7, tracking metadata lock, Tracking Metadata Locks in MySQL 5.7, who is holding the metadata lock, mariadb metadata locks

I recently blogged about tracking metadata locks in the latest MySQL, and now I want to discuss how to track these metadata locks in MariaDB.

In MySQL 5.7, there is a table named `metadata_locks` added to the performance_schema (performance_schema must be enabled *and* the metadata_locks instrument must be specifically enabled as well.

In the MariaDB 10.0 implementation (as of 10.0.7), there is a table named METADATA_LOCK_INFO added to the *information_schema*. This is a new plugin, so the plugin must be installed, but that is very simple with:

INSTALL SONAME 'metadata_lock_info';

Then, you will have the table.

To see it in action:

Connection #1:

mysql> create table t (id int) engine=myisam;
mysql> begin;
mysql> select * from t;

Connection #2:

mysql> alter table t add index …

[Read more]

Feb

2014

Tracking Metadata Locks (MDL) in MySQL 5.7

Posted by Chris Calender on Sat 01 Feb 2014 00:41 UTC
Tags:

I’ve blogged about metadata locks (MDL) in the past (1 2 3) and in particular discussed how best to track them down and troubleshoot threads stuck waiting on metadata locks.

If you’ve had any experience with these, you’ll know finding them isn’t always the most straight-forward task.

So I was glad to see metadata lock instrumentation added to MySQL 5.7.3 as part of performance_schema, which makes tracking these down a breeze! (Note this is only in 5.7.3 currently, and therefore is some time from being GA as of today)!

To use these, performance_schema must be enabled (i.e., performance_schema=1 in your config file).

But, also, the metadata_locks instrument is disabled by default, so even if you enable the …

[Read more]

Nov

2011

Data Modeling

Posted by Matt Casters on Thu 03 Nov 2011 14:07 UTC
Tags:

Open Source, Data Integration, metadata, Kettle, PDI, Kimball, Multidimensional modeling

Dear data integration fans,

I’m a big fan of “appropriate” data modeling prior to doing any data integration work. For a number of folks out there that means the creation of an Enterprise Data Warehouse model in classical Bill Inmon style. Others prefer to use modern modeling techniques like Data Vault, created by Dan Linstedt. However, the largest group data warehouse architects use a technique called dimensional modeling championed by Ralph Kimball.

Using a modeling technique is very important since it brings structure to your data warehouse. …

[Read more]

Aug

2011

Viewing RMAN jobs status and output

Posted by Ben Mildren on Fri 26 Aug 2011 05:10 UTC
Tags:

Oracle, Backup, Group Blog Posts, metadata, views, query, restore, Technical Blog, rman, gv$rman_output, output, v$, V$BACKUP_SET, V$BACKUP_SET_DETAILS, V$RMAN_BACKUP_JOB_DETAILS, v$rman_output

Yesterday I was discussing with a fellow DBA about ways to check the status of existing and/or past RMAN jobs. Good backup scripts usually write their output to some sort of log file so, checking the output is usually a straight-forward task. However, backup jobs can be scheduled in many different ways (crontab, Grid Control, Scheduled Tasks, etc) and finding the log file may be tricky if you don’t know the environment well.
Furthermore, log files may also have already been overwritten by the next backup or simply just deleted. An alternative way of accessing that information, thus, may come handy.

Fortunately, RMAN keeps the backup metadata around for some time and it can be accessed through the database’s V$ views. Obviously, if you need this information because your database just crashed and needs to be restored, the method described here is useless.

Backup jobs’ status and metadata

A lot of metadata about …

[Read more]

May

2011

Dynamic de-normalization of attributes stored in key-value pair tables

Posted by Matt Casters on Mon 23 May 2011 14:06 UTC
Tags:

Data Integration, metadata, ETL, Normalization, de-normalization, pentaho data integration, injection, key value pairs

Dear Kettlers,

A couple of years ago I wrote a post about key/value tables and how they can ruin the day of any honest person that wants to create BI solutions. The obvious advice I gave back then was to not use those tables in the first place if you’re serious about a BI solution. And if you have to, do some denormalization.

However, there are occasions where you need to query a source system and get some report going on them. Let’s take a look at an example :

mysql> select * from person;
+----+-------+----------+
| id | name  | lastname |
+----+-------+----------+
|  1 | Lex   | Luthor   |
|  2 | Clark | Kent     |
|  3 | Lois  | Lane     |
+----+-------+----------+
3 rows in set (0.00 sec)

mysql> select * from person_attribute;
+----+-----------+---------------+------------+
| id | person_id | attr_key      | attr_value | …

[Read more]

Feb

2011

Parse nasty XLS with dynamic ETL

Posted by Matt Casters on Fri 25 Feb 2011 12:07 UTC
Tags:

Data Integration, metadata, ETL, Kettle, Excel, pentaho data integration, injection, Spreadsheet

Dear Kettle friends,

Last year, right after the summer in version 4.1 of Pentaho Data Integration, we introduced the notion of dynamically inserted ETL metadata (Youtube video here). Since then we received a lot of positive feedback on this functionality which encouraged me to extend it to a few more steps. Already with support for “CSV Input” and “Select Values” we could do a lot of dynamic things. However, we can clearly do a lot better by extending our initiative to a few more steps: “Microsoft Excel Input” (which can also read ODS by the way), “Row Normalizer” and “Row De-normalizer”.

Below I’ll describe an actual (obfuscated) example that you will probably recognize as it is equally hideous as simple in it’s horrible complexity.

Take a look at this file:

Let’s assume that this spreadsheet …

[Read more]

Feb

2011

Drizzle metadata tables

Posted by Stewart Smith on Tue 08 Feb 2011 00:03 UTC
Tags:

postgresql, metadata, drizzle, information_schema, data_dictionary, MySQL

Giuseppe has a great post about the Evolution of MySQL metadata, and I thought I’d have a look at what we have in Drizzle. It’s pretty easy to work out how many tables are in each schema, we just query the standard INFORMATION_SCHEMA.TABLES view:

drizzle> select table_schema,count(table_name)
    ->  from information_schema.tables
    -> group by table_schema;
+--------------------+-------------------+
| table_schema       | count(table_name) |
+--------------------+-------------------+
| DATA_DICTIONARY    |                53 |
| INFORMATION_SCHEMA |                20 |
+--------------------+-------------------+
2 rows in set (0 sec)

In Drizzle it’s important to note that there is a differentiation between SQL …

[Read more]

Feb

2011

Evolution of MySQL metadata

Posted by Giuseppe Maxia on Mon 07 Feb 2011 18:00 UTC
Tags:

metadata, information_schema, performance_schema, binaries, MySQL

I was looking at the latest MySQL versions, and I happened to notice that there has been a great increment in the number of metadata tables, both in the information_schema and performance_schema databases. So I made a simple count of both schemas in the various versions, and draw a graph. The advance looks straightforward.

…

version	Information_schema	performance_schema
5.0.92	17	0
5.1.54	28	0
5.1.54 with innodb plugin	35	0
5.5.8	37	17
5.6.2	48

[Read more]

Top Authors

Oracle MySQL Blogs

Vendor Blogs

MySQL Links