Showing entries 1 to 8
Displaying posts with tag: solr (reset)
Which tech do startups use most?

Leo Polovets of Susa Ventures publishes an excellent blog called Coding VC. There you can find some excellent posts, such as pitches by analogy, and an algorithm for seed round valuations and analyzing product hunt data. He recently wrote a blog post about a topic near and dear to my heart, Which Technologies do Startups […]

A book every php developer should read

Once upon a time, a long long time ago, when there was no Solr and lucene used to be a search engine api available for php developers to use, they used to struggle for using lucene. Most people reverted back to Mysql full text search. And some ventured into using sphinx - another free full text search engine. Then came Solr and php developers were thrilled with the ease of use over the http

The SMAQ stack for big data

SMAQ report sections

→ MapReduce

→ Storage

→ Query

→ Conclusion

"Big data" is data that becomes large enough that it cannot be processed using conventional methods. Creators of web search engines were among the first to confront this problem. Today, social networks, mobile phones, sensors and science contribute to petabytes of data created daily.

To meet the challenge of processing such large data sets, Google created MapReduce. Google's work and Yahoo's creation of the Hadoop MapReduce implementation has spawned an ecosystem of big data processing tools.

As MapReduce has grown in popularity, a stack for big data systems …

[Read more]
Disrupting IT with Open Source & Cloud

A couple of weeks ago I gave a presentation at the Apache Lucene Eurocon in Prague. It was a good conference focused on Lucene/Solr open source search technology and sponsored by Lucid Imagination.  

I've posted the bulk of the presentation below.  (I omitted a couple of slides that were MySQL specific.) Even though it was a technical conference, I got positive feedback from the attendees and organizers that the information was useful in helping folks think about where to focus their efforts.  

The slides have been posted to Box.net and are shown using their new "embedded preview" feature which is pretty cool. …

[Read more]
451 CAOS Links 2010.01.08

Google unveils the Nexus One. RMS explains his position on dual licensing. And more.

Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca
“Tracking the open source news wires, so you don’t have to.”

For the latest on Oracle’s acquisition of MySQL via Sun, see Everything you always wanted to know about MySQL but were afraid to ask

# Google launched the Nexus One Android phone.

# Richard Stallman explained his position on selling exceptions to the GNU GPL.

# Novell’s chief technology and …

[Read more]
Comparison Between Solr And Sphinx Search Servers (Solr Vs Sphinx – Fight!)

In the past few weeks I've been implementing advanced search at Plaxo, working quite closely with Solr enterprise search server. Today, I saw this relatively detailed comparison between Solr and its main competitor Sphinx (full credit goes to StackOverflow user mausch who had been using Solr for the past 2 years). For those still confused, Solr and Sphinx are similar to MySQL FULLTEXT search, or for those even more confused, think Google (yeah, this is a bit of a stretch, I know).

Similarities

  • Both Solr and Sphinx satisfy all of your requirements. They're fast and designed to index and search large bodies of data efficiently.
  • Both have a long list of high-traffic sites …
[Read more]
451 CAOS Links 2009.06.30

Governments. Governance. Customers wins. And more.

Follow 451 CAOS Links live @caostheory
“Tracking the open source news wires, so you don’t have to.”

Governments
The Examiner provided a two part interview with Daniel Risascher, Office of the CIO, Department of Defense, on open source at the DoD, while Government Technology Magazine reported on how open source software and cloud computing can save government money. Similarly, The UK Conservative party delivered a paper on the future of open standards, open source, SOA and cloud for UK Government, while it was reported that Vienna to …

[Read more]
django - signals

continuing the recent thread about contenttypes in django I thought I would talk about a feature which got added in the magic removal branch, which doesn’t have as much attention as I think it deserves.

signals and the dispatcher.

signals are way of telling the rest of the world that something happened. If you are interested you simply listen for it (connect in django speak).

take for example my tagging application currently in use on zyons. one of it’s features is that it let’s users store their own tags.

One of the performance improvements I added to this was the creation of a ‘summary’ tag which aggregates which the …

[Read more]
Showing entries 1 to 8