LibreOffice and Apache OpenOffice at the command line
By the Script
Your favorite office suites have a surprisingly active life at the command line, and various supplementary scripts can help you perform some functions more easily there than on the desktop.
LibreOffice and Apache OpenOffice are primarily desktop applications. However, you might not know that the way these applications open is determined by command-line options. You can also print and convert files from the command line, in some cases more easily than from the desktop. Additionally, of the various supplementary scripts that have been written for their predecessor OpenOffice.org, at least two sets of utilities continue to be useful for manipulating their files from the command line.
You may need to search for these utilities, because they are not packaged for all distributions. Still, whether you are a home user or a developer who wants to summon LibreOffice as part of a more elaborate script, these tools can be worth hunting down. Like many command-line tools, they allow you to perform functions more easily than on the desktop or do things that are not even options in the editing window.
LibreOffice and OpenOffice Command Options
Both LibreOffice and Apache OpenOffice start with the command
soffice, which is short for StarOffice, the name of the application before Sun Microsystems released the code in July 2000. Some of the options listed by the
--help option are obsolete, such as
--nofirststartwizard, which refer to features no longer in recent versions. Others, like
--display, are useful only on Linux, but many of the others are useful on any platform.
The most obvious way to modify this basic command is with options that start with a specific module:
--writer for Writer,
--calc for Calc,
--draw for Draw,
--impress for Impress, and
--math for Math. Less obvious is
--web, which opens Writer in the Web Layout view, and
--global, which opens a new master document in Writer.
If you want to start work directly, you can open an existing file with
-o FILENAME or a read-only copy with
--view FILENAME. Alternatively, if you know the template you want to use,
-n TEMPLATE opens a new document using the template specified. Similarly, you can start a slide show directly – instead of opening it to edit – with
Developers might be interested in
--headless, which opens the application without a GUI, or at least
--invisible, which opens the application so its features are open only via the API. All users might be interested in the
--nologo option to start LibreOffice or OpenOffice without a splash screen or
--norestore to disable file recovery after a crash to ensure a slightly faster – or at least less annoying – startup time.
Other options let you print files without opening the editing window. Using
-p, you can print a space-separated list of files to the default print. Alternatively, you can specify a printer with
--pt PRINTERNAME to print a space-separated list of files wherever you want. Neither returns any output to the command-line prompt.
Still another option is
--print-to-file PRINTERNAME OUTPUTDIRECTORY FILES, which is useful for generating PostScript files (as long as the printer supports PostScript, of course). Interestingly enough,
--print-to-file supports space-separated lists of all parameters, which means that different printers can print different files to different directories all at the same time.
The command line also supports conversion of files from one supported format to another with
--convert OUTPUTFILEEXTENSION OUTPUTDIRECTORY FILES. You can use any file extension listed in the Save or Save As dialogs for LibreOffice or OpenOffice. If no output directory is specified, then the output files are created in the present working directory. This option is the equivalent of File | Wizards | Document Converter.
Over the years, some of the utilities written for OpenOffice.org or its successors have become obsolete as features and circumstances have changed. However, one utility that has proved long-lived is OpenOffice.org Python (OOoPy). OOoPy  is a collection of Python scripts for working with LibreOffice and OpenOffice files. It is best run with Python 2.5 or higher, although you can run it with Python 2.3 plus the ElementTree library .
To use the scripts, download the OOoPy file and unarchive it. From the directory in which you placed the file, run the command:
python setup.py install
~/.local/share/ooopy will be created. You can either run the scripts from the
/bin subdirectory of the directory to which you downloaded, or you add the directory to your environment.
Some of the scripts, such as ooo_fieldreplace, ooo_mailmerge, and ooo_prettyxml, are of interest mainly to Python developers and are also seriously underdocumented. The remainder, with their triple Os, underscores, and help confined to the bare structure displayed by
--help, take careful typing, but they are well worth the effort.
For example, if you want to extract text from a document, you can use
ooo_as_text with the syntax:
ooo_as_text --output-file OUTPUTFILE INPUTFILE, INPUTFILE
Most likely, you will want to add
-n to add a line to mark the end of each paragraph. Other formatting will be lost, but the output can be read in any text editor. The script may be able to recover a corrupted file, and, in some cases, this may be the easiest way to begin converting a file to another format.
ooo_as_text script is also used alongside GNU grep to run
ooo-grep does not give you all the options of grep, but it does allow you to search files with all the regular expressions that grep supports. Its results are the same as those of
grep -l; that is, it returns the names of files that contain matches.
Another useful script is
ooo_cat, which combines a comma-separated list of files into a single file. Aside from the command,
ooo_cat has the same structure as
Writer2LaTeX  has outgrown its name. Originally, it was a Java program for converting documents produced in LibreOffice's or OpenOffice's word processor to LaTeX, which many users prefer for layout. Now, however, the program also includes options for converting bibliographic data to BibTeX format, Writer files to ePub, and Writer and Calc files to XHTML. This growth is reflected in a 73-page manual . The basic command structure for Writer2LaTeX is:
w2l INPUTFILE OUTPUTFILE
By default, the conversion is to LaTeX. However, you can specify formats with the options
-epub, and others (Figure 1). Unlike the OOoPy scripts, Writer2LaTeX does display status reports at the command line.
The formatting of LaTeX files specifically can also be detailed. With
-ultraclean, most formatting is ignored. Using
-clean, hyperlinks, color, and limited formatting is preserved. Additionally, you can specify whether the LaTeX file is optimized to produce a PDF file for viewing on the screen with
-pdfscreen or for printing with
Similarly, conversions to XHTML can be adjusted by selecting a template with
--template FILE or a style sheet with
--stylesheet FILE. The documentation also includes more than 45 pages about creating style sheets and templates to create output files that are formatted with more precision.
Buy this article as PDF
Kernel king admits his tone has alienated volunteers, but says the demands of the process require directness.
New flaw in an old encryption scheme leaves the experts scrambling to disable SSL 3
Lennart Poettering wants to change the way Linux developers talk to each other.
Enterprise giant frees itself from ink and home PCs (and visa versa).
Mozilla’s product think tank sinks silently into history.
TODO group will focus on open source tools in large-scale environments.
New tool will look like GParted but support a wider range of storage technologies.
New public key pinning feature will help prevent man-in-the-middle attacks.
Carnegie Mellon researchers say 3 million pages could fall down the phishing hole in the next year.
The US government rolls new best-practice rules for protecting SSH.