Showing entries 1 to 10 of 11
1 Older Entries »
Displaying posts with tag: Language (reset)
Random human recognizable dataset

We all do need sometimes to generate raw valid dummy data for our use cases and applications as we start them. Obviously, one can write their own scripts to generate random data, but it is much better to have data, to which human beings can associate with like names, addresses instead of having them filled with random "lorem ipsum" string data :)

While searching for such a tool, I found a site which does exactly this: http://www.generatedata.com/

Documentation: http://benkeen.github.io/generatedata/

This can also be downloaded and installed locally. It supports three types of installations:
- A single, anonymous user account
- A single user account, requires login
- Multiple accounts

Below is the set of wide varied data types it supports for …

[Read more]
Really large NLP corpora

Jeeze people. You’re all noisy. I’m sure it was all done for posterity’s sake.

23M     irclogs/MagNET/#perl.log
29M     irclogs/freenode/#mysql.log
36M     irclogs/freenode/#debian.log
37M     irclogs/foonetic/#xkcd.log
39M     irclogs/OFTC/#debian.log
43M     irclogs/freenode/#jquery.log
44M     irclogs/freenode/#perl.log

$ for file in irclogs/MagNET/#perl.log irclogs/freenode/#mysql.log irclogs/freenode/#debian.log irclogs/foonetic/#xkcd.log irclogs/OFTC/#debian.log irclogs/freenode/#jquery.log irclogs/freenode/#perl.log; do echo -n "$file: " ; head -1 $file ; done
irclogs/MagNET/#perl.log: --- Log opened Thu May 26 08:31:32 2011
irclogs/freenode/#mysql.log: --- Log opened Wed Dec 28 09:03:49 2011
irclogs/freenode/#debian.log: --- Log opened Tue Mar 12 12:52:40 2013
irclogs/foonetic/#xkcd.log: --- Log opened Wed Dec 28 19:33:43 2011
irclogs/OFTC/#debian.log: --- Log opened Tue Jul 12 19:25:48 2011
irclogs/freenode/#jquery.log: --- Log opened Tue Jan 31 …
[Read more]
Four short links: 11 January 2010
  1. mytop -- a MySQL top implementation to show you why your server is so damn slow right now.
  2. What Could Kill Elegant High-Value Participatory Project? -- The problem was not that the system was buggy or hard to use, but that it disrupted staff expectations and behavior. It introduced new challenges for staff [...]. Rather than adapt to these challenges, they removed the system. [...] No librarian would get rid of all the Harry Potter books because they are "too popular." No museum would stop offering an educational program that was "too successful." These are familiar challenges that come with the job and are seen to have benefit. But if tagging creates a line or people spend too much time giving you feedback? Staff at …
[Read more]
Four short links: 26 October 2009
  1. Toiling in the Data Mines -- Tom Armitage describes the process that Berg calls "material exploration". Programmers very rarely talk about what their work feels like to do, and that's a shame. Material explorations are something I've really only done since I've joined BERG, and both times have felt very similar - in that they were very, very different to writing production code for an understood product. They demand code to be used as a sculpting tool, rather than as an engineering material, and I wanted to explain the knock-on effects of that: not just in terms of what I do, and the kind of code that's appropriate for that, but also in terms of how I feel as I work on these explorations. Even if the section on the code itself feels foreign, I hope that the explanation of what it …
[Read more]
Why a world religion? And in 20 languages?

In a blog entry I’ve explained why I consider running a religion and why I want to spread this religion with social media. Still, what’s the point with doing it worldwide? Isn’t enough to do it close to my own front door, either at home in Munich, Germany or in my native Finland?

The reason relates to my interest for other countries, cultures and languages. Through my work, I’ve travelled a lot especially last year, as Ambassador of the product of my former employer. As I …

[Read more]
Hidden Features Of Perl, PHP, Javascript, C, C++, C#, Java, Ruby, Python, And Others [Collection Of Incredibly Useful Lists]

Introduction

StackOverflow is an amazing site for coding questions. It was created by Joel Spolsky of joelonsoftware.com, Jeff Atwood of codinghorror.com, and some other incredibly smart guys who truly care about user experience. I have been a total fan of SO since it went mainstream and it's now a borderline addiction (you can see my StackOverflow badge on the right sidebar).

The Story

Update 6/21/09: This server is currently under very heavy load (10-200), even with caching plugins enabled. Please bear with me as I try to resolve the situation.

Feel free to …

[Read more]
Career trap: Internet

Do you read German? Then I have two recommendations for you.

First, go read this fresh interview with the German social networking guru Klaus Eck. It’s about “a life long job application process”.

Second, go read the book “Karrierefalle Internet” (”Career trap: Internet”) by Klaus Eck. That book is what the interview is all about.

Klaus Eck’s basic statement is “go manage your online reputation before others do it for you“. He notes that he’s seen plenty of Angst amongst …

[Read more]
Argentina Launch of MySQL: Customers, Meetings, Press


Señoras y señores, I am happy to have been part of the MySQL launch in Argentina yesterday. Visiting Argentina has been a great opportunity to meet with the MySQL users and not-yet-users in a country with 30 degrees Celsius, with colourful houses in La Boca, with an omnipresent Diego Maradona, and only minor challenges in the form of payment methods when using local transport.

On Monday evening, I visited the Universidad Nacional De La Matanza. Together …

[Read more]
Now I’m blogging in Russian, too!

To understand a bit of Italian, I just need a comparatively small amount of vino bianco. By contrast, to get any information flow going at all in Russian requires larger amounts of … preparation. That doesn’t have to be vodka, it can also be interesting discussions with Russians, or the opportunity to give a speech.

Now, a blog is the scalable way to interact with the rest of humanity, and I’m trying to increase my fluency in all things Web 2.0. So, here goes, may I present my Russian blog:

Like in the case of presenting my Italian blog, let me quote Google Translate’s automatic translation of some of my “writings” — deliberately doing so without making any improvements on the automatic translation:

[Read more]
I’m blogging in Italian!

Given that I don’t speak Italian, it may seem a bit strange that I just started an Italian language blog on http://blogs.arno.fi/dolce_vita/:

But I do have a point with my blog. Let me quote Google Translate’s automatic translation of some of my “writings” — deliberately doing so without making any improvements on the automatic translation:

Why this blog?

“Of all the languages that I do not speak, I speak Italian the best.”

This is my motto when it comes to speaking Italian. Moreover, Italy is my favorite …

[Read more]
Showing entries 1 to 10 of 11
1 Older Entries »