Big Data search engine for full-text strings and photos with radius search
Nice Scenery
The find()
function also recursively digs through subdirectories. For the search engine to store the geodata in a way that optimizes the query performance, I need to add a mappings directive: The create()
command as of line 15 defines a geo_point
property by the name of Location
for the photo
document type used in the photos
index. The documentation for this [8] is out of date, by the way; the mapping it describes no longer works. I have, however, successfully tested Listing 4 with Elasticsearch release 1.0.0 RC2.
Starting with the jpeg images found by the search, line 32 in Listing 4 uses the IPhonePicGeo module to extract the geodata and pushes it, along with the file names, into the elastic database in the body
section of the index()
method starting in line 35.
After the data of all the photos has been indexed in this way, the script in Listing 5 retrieves all the snapshots that I took within 1km of the reference photo passed in at the command line. For this purpose, it ascertains the geodetic information of the reference image and then sends a match_all()
query, which returns all stored images. Line 23 turns on a filter that limits the geo_distance
to 1km. Additionally, the size
parameter increases the maximum number of hits to 100
.
Listing 5
photo-gps-match
This returns a list of photo objects, of which line 37 extracts the original file name and pushes it to the end of the array @files
. Finally, the system()
function in line 40 calls eog
(The Eye of Gnome application), which displays all the results as thumbnails (Figure 5). You can now click your way through them to explore the vicinity.

No Limits
The geo-function is just one of many plugin-like extensions of the Elasticsearch server, a useful tool that is easy to install and operate. It also scales practically infinitely because, as the volume of data increases, the administrator can distribute the indexes to a sufficiently large number of other Apache Lucene shards, to again run all queries with the required level of performance.
Books on paper and electronic form exist for Elasticsearch, but unfortunately, I can't really recommend any of them. That said, however, the tutorial [10] can be a help, and volunteers will answer questions on Stackoverflow.com.
Mike Schilli
Mike Schilli works as a software engineer with Yahoo! in Sunnyvale, California. He can be contacted at mschilli@perlmeister.com. Mike's homepage can be found at http://perlmeister.com.
Infos
- Elasticsearch download site: http://www.elasticsearch.org/overview/elkdownloads/
- Elasticsearch-1.03: http://search.cpan.org/~drtech/Elasticsearch-1.03/
- ElasticSearch-0.66: http://search.cpan.org/~drtech/ElasticSearch-0.66/
- Listings for this article: ftp://ftp.linux-magazin.com/pub/listings/magazine/162
- "Card Trick" by Mike Schilli: http://www.linux-magazine.com/w3/issue/95/072-076_perl.pdf
- "Don't Blame the Gardener" by Mike Schilli: http://w3.linux-magazine.com/issue/77/Perl_Linux-based_Gardening.pdf
- Elasticsearch documentation: http://www.elasticsearch.org/resources/
- Elasticsearch geo--distance filter: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-geo-distance-filter.html
- "New Bay Bridge spanning San Francisco Bay finally finished" by Mike Schilli: http://usarundbrief.com/103/index-en.html
- Elasticsearch tutorial: http://joelabrahamsson.com/elasticsearch-101/
« Previous 1 2 3
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Direct Download
Read full article as PDF:
Price $2.95
News
-
KaOS 2022.06 Now Available With KDE Plasma 5.25
The newest iteration of KaOS Linux not only adds the latest KDE Plasma desktop but sets LibreOffice as the default.
-
Manjaro 21.3.0 Is Now Available
Manjaro “Ruah” has been released and includes the latest Calamares installer, GNOME 42, and much more.
-
SpiralLinux is a New Linux Distribution Focused on Simplicity
A new Linux distribution, from the creator of GeckoLinux, is a Debian-based operating system with a focus on simplicity and ease of use.
-
HP Dev One Linux Laptop is Now Available for Pre-Order
The System76/HP collaboration Dev One laptop, geared toward developers, is now available for pre-order.
-
NixOS 22.5 Is Now Available
The latest release of NixOS with a much-improved package manager and a user-friendly graphical installer.
-
System76 Teams up with HP to Create the Dev One Laptop
HP and System76 have come together to develop a new laptop, powered by Pop!_OS and aimed toward developers.
-
Titan Linux is a New KDE Linux Based on Debian Stable
Titan Linux is a new Debian-based Linux distribution that features the KDE Plasma desktop with a focus on usability and performance.
-
Danielle Foré Has an Update for elementary OS 7
Now that Ubuntu 22.04 has been released, the team behind elementary OS is preparing for the upcoming 7.0 release.
-
Linux New Media Launches Open Source JobHub
New job website focuses on connecting technical and non-technical professionals with organizations in open source.
-
Ubuntu Cinnamon 22.04 Now Available
Ubuntu Cinnamon 22.04 has been released with all the additions from upstream as well as other features and improvements.