Putting free digital assistants to the test
Out of the Box
Sirius is not being further developed, which is reason enough to cast a glance at its successor, Lucida [9]. Since compiling from the sources [17] did not succeed, the test team opted for Docker containers that had a trimmed-down demo version. After installing docker
and docker-compose
, you should run the following two commands:
sudo docker pull claritylab/lucida:latest sudo docker pull claritylab/lucida-asr
In the process, around 17GB copies to the disk. Next, access the main container with this command:
sudo docker run -i -t claritylab/lucida /bin/bash
The docker-compose.yml
file, which is necessary to start the demo version on the host system, is in the root directory. You are advised to copy its content onto the clipboard and paste it into a new docker-compose.yml
file after leaving the Lucida container with exit
; then, start all the Lucida services from the same directory:
sudo docker-compose up
The developers recommend using the Lucida web interface (http://localhost:8081) with the Chrome browser or free Chromium variant. A Wikipedia data dump is not present. The web front end instead asks for text to form the evidence base for the question-answering system. You enter this yourself and then click on Submit
.
Lucida then asks for the access privileges for the microphone, because the demo version exclusively communicates by speech. You click on the microphone symbol to activate this, and speak your question. Kaldi operates in the background and attempts to understand the text. The result is shown in the speech bubble on the left, with the answer opposite on the right (Figure 6).
As previously seen in the Sirius experiment with Kaldi (Table 1), dictating a sentence and having it correctly recognized was only possible with great difficulty. In this Docker edition, it is not possible to send questions to the system from your keyboard, and image recognition is likewise lacking.
A process to exchange the Kaldi speech recognition back end is not provided, meaning that the test team was also unable to experiment with PocketSphinx or Sphinx4. In the GitHub repository, the developers state that they will publish the next generation of Lucida toward the end of summer 2016 [18]. Along with a new command center, they should especially enclose a better question-answering system and a guide for how users can exchange individual components.
Future Helpers
Sirius and Lucida are not suitable for serious use to support you in everyday life, so Linux users will probably have to wait a while before they can get reasonable answers or real help from a digital assistant. The performance of the free programs – still a long way behind that of commercial alternatives – presumably is not because of an inferior quality of software, however. Firms such as Google or Apple have undoubtedly invested a large amount of money into training efforts. In the case of speech recognition, for instance, training consists of tedious tasks providing hours of recordings and phonetic transcription.
In principle, as a user, you can also train the free components and accustom them to your voice. However, linguistic knowledge and, most of all, staying power are needed to make that happen.
Infos
- Sirius: http://sirius.clarity-lab.org/sirius
- Hauswald, Johann, Michael A. Laurenzano, Yunqi Zhang, et al. "An open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers." In: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS, 2015), New York: ACM, pp. 223-238.
- CMU Sphinx: http://cmusphinx.sourceforge.net
- Kaldi: http://kaldi-asr.org
- OpenCV: http://opencv.org
- OpenEphyra: http://www.ephyra.info
- Caffe: http://caffe.berkeleyvision.org
- Sirius seminar during ASPLOS-20: http://sirius.clarity-lab.org/tutorial
- Lucida: http://lucida.ai
- Sirius downloads: http://sirius.clarity-lab.org/downloads/#sirius
- Lemur's Indri: http://www.lemurproject.org/indri.php
- OpenEphyra architecture: https://mu.lti.cs.cmu.edu/trac/Ephyra/wiki/Docs/ArchitectureOverview
- Status of OpenEphyra: https://github.com/claritylab/lucida/issues/89
- Freeimages: http://www.freeimages.com
- Google Goggles: http://www.google.com/mobile/goggles
- CamFind: http://camfindapp.com
- Lucida GitHub repo: https://github.com/claritylab/lucida
- Next Lucida version: https://github.com/claritylab/lucida/issues/116
« Previous 1 2 3
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.
News
-
Endless OS 6 has Arrived
After more than a year since the last update, the latest release of Endless OS is now available for general usage.
-
Fedora Asahi 40 Remix Available for Macs with Apple Silicon
If you've been anticipating KDE's Plasma 6 for your Apple Silicon-powered Mac, then you're in luck.
-
Red Hat Adds New Deployment Option for Enterprise Linux Platforms
Red Hat has re-imagined enterprise Linux for an AI future with Image Mode.
-
OSJH and LPI Release 2024 Open Source Pros Job Survey Results
See what open source professionals look for in a new role.
-
Proton 9.0-1 Released to Improve Gaming with Steam
The latest release of Proton 9 adds several improvements and fixes an issue that has been problematic for Linux users.
-
So Long Neofetch and Thanks for the Info
Today is a day that every Linux user who enjoys bragging about their system(s) will mourn, as Neofetch has come to an end.
-
Ubuntu 24.04 Comes with a “Flaw"
If you're thinking you might want to upgrade from your current Ubuntu release to the latest, there's something you might want to consider before doing so.
-
Canonical Releases Ubuntu 24.04
After a brief pause because of the XZ vulnerability, Ubuntu 24.04 is now available for install.
-
Linux Servers Targeted by Akira Ransomware
A group of bad actors who have already extorted $42 million have their sights set on the Linux platform.
-
TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU
This latest release is the first laptop to include the new CPU from Ryzen and Linux preinstalled.