Showing entries 37411 to 37420 of 44082
« 10 Newer Entries | 10 Older Entries »
Predicting how long data load would take

I had this question asked many times during last week, and there is really no easy answer. There are just way too many variables to consider especially while loading large schemas with a lot of different table structures. So which variables affect the load speed:

Table Structure This one is tricky. The shorter rows you have the faster load will normally be, if you measure rows, but if you measure in amount of loaded data longer rows are faster.

Indexes The more indexes you have the longer load would take. Details depend on storage engines, for example MyISAM can build non-unique indexes by sort which is very fast and Innodb can use insert buffer for them.

Data Insert Order It is well known it is important to load data in primary key order into Innodb tables for optimal performance, but effect does not end here. If indexes are built by inserts in Btree order in which …

[Read more]
Open Source Business out of Spain – Case Warp

I will now and then blog in English as sometimes there is a need for people to comment in English. To get things started I asked several questions from Ignacio Correas of Warp , one of the leading Open Source companies out of Spain.

1. How did you get started?

The idea of founding a consultancy company specialized on open source technologies came from six guys which were probable the best Linux experts and most active open source advocates in the region. A year before, I had left a good job in Finland to try to start the same kind of company in Zaragoza, so when they asked me to join in as the non-geek member of the team I could not resist.

Before founding Warp we applied to become a part of CEEI, which is a incubator for innovative companies funded by the regional government and which gives some grants and support for its members. On the other hand, …

[Read more]
MySQL Optimizer Bug 28554

When we tried to clean up a rather large (4.500.000 rows, 20GB) InnoDB table some days ago, we were astonished by the time MySQL took to complete the task. We had already LIMITed the transaction size, but every single chunk still took minutes to execute. The table itself contains some number columns, including a numeric primary key, and a blob. The delete condition was mainly based on the primary key (being smaller than a predefined value) and status field. After some mails between the support crew and us an optimizer bug was identified: MySQL Bug #28554.

The problem is that in some cases the optimizer makes a bad choice concerning which index to use. It will pick a secondary index that can be used to cover a WHERE indexed_column=<constant> condition, even though it will cause way more data to be scanned than necessary. The primary key for …

[Read more]
MySQL Optimizer Bug 28554

When we tried to clean up a rather large (4.500.000 rows, 20GB) InnoDB table some days ago, we were astonished by the time MySQL took to complete the task. We had already LIMITed the transaction size, but every single chunk still took minutes to execute. The table itself contains some number columns, including a numeric primary key, and a blob. The delete condition was mainly based on the primary key (being smaller than a predefined value) and status field. After some mails between the support crew and us an optimizer bug was identified: MySQL Bug #28554.

The problem is that in some cases the optimizer makes a bad choice concerning which index to use. It will pick a secondary index that can be used to cover a WHERE indexed_column=<constant> condition, even though it will cause way more data to be scanned than necessary. The primary key for …

[Read more]
Innodb Recovery Update - The tricks what failed.

As I wrote we had been recovering corrupted Innodb Tablespace and it is finally done now.

As this was over than 1TB worth of data we really tried to avoid dumping the data and find some other way to recovery.
Examining Innodb page content and crash information we figured out it should be page of insert buffer itself, rather than page belonging to some of the pages touched by insert buffer.

If that would be just the table it should have been easy - dropping the table with corrupted page would remove all appropriate insert buffer records and be able just to load that table back. Unfortunately we were not that lucky.

If the page corrupted in insert buffer itself there seems to be no way to make Innodb avoid touching it. Even when we dropped all Innodb tables (for test purposes) Innodb still was crashing if we …

[Read more]
Trying to find time to move into my apartment ..

.. is not so easy. I have been traveling for 4 weeks in USA, then this weekend I was at a wedding back in Germany. This weekend I will be trying out for the German national mixed frisbee team at the BUM (I rate my chances of making it pretty low - injured finger, never found a top level team I wanted to play in, which means that all the top level players do not know me that well .. and unfortunately I am also simply not that uber-good .. but hey I can try).

The following weekend I was also planning on going to play frisbee at the HUT in Halle, but since I will be in Norway the following weekend and the Netherlands the weekend after that, I guess I should really skip Halle. So I will try to go out and buy a bed, mattress, table and a cupboard or …

[Read more]
OOP<->RDBMS

It seems the slides that generated the most interested in my recent "Database meets OOP" talk were the ones on how to model inheritance inside your RDBMS. I wanted to mention two (and a half) additional approaches.

The first one I have no real experience with and is native inheritance support that can be found in Object-Relational DBMS (not to be confused with Object-Oriented DBMS, which are not relational). PostrgreSQL is a shining example of this, so they have native support for inheritance. Actually the PostgreSQL docs mention that since SQL99 inheritance is part of the spec, not sure if anyone has implemented it though. From my reading it looks like its just a convenience feature to spare you from having to writing joins (when fetching data from a child …

[Read more]
my first link from a .gov site

It's pretty cool to see link to your blog posted on a .gov domain :)

frankmash.blogspot.com is another one of my blogs where I post sometimes.

SAP Gets Serious About Web 2.0

SAP Chairman and co-founder Hasso Plattner has been out discussing how SAP is redefining itself and investing in building the next generation of Web 2.0 software for the Enterprise.  SAP has been out doing a few acquisitions in recent months, presumably to accelerate the pace of change.  While SAP has long been a quiet supporter of open source technology and investor in companies like MySQL and Zend,

Plattner is down right enthusiastic about opening up processes to have more community involvement and a more accessible hosted approach.  Plattner cited Google and Salesforce.com as a good models of how to rapidly deliver high quality information to users.  SAP is investing heavily in a new initiative known as A1S, which is a new online hosted offering based on a collaborative service-oriented architecture (SOA).

  • CNet:
[Read more]
Entry the First

Well there hasn't been any actual coding work done yet, this is just the project setup phase. This blog is connected to the Google Summer of Code 2007 Project "MySQL Auditing Software", the project page can be found at this link.

The project team is composed of myself, Umair Mehmood and the project mentor, Sheeri Kritzer. We'll be holding a conference call this Friday on Skype in order to exchange notes and design ideas.

I already have a fair idea of how this software can be designed, based on discussions with the project team members and research done into similar products I believe that a design based on passive packet capture is a good way to go. The resulting captured data can then be interpreted and reformed in whatever way we see fit before being passed on to the database servers.

Showing entries 37411 to 37420 of 44082
« 10 Newer Entries | 10 Older Entries »