Presentation - Localization World

Download Report

Transcript Presentation - Localization World

Integrating Open-Source
Solutions in your Processes
Barcelona – June 2011
Stefan Pries - Andrä AG
Yves Savourel - ENLASO
Why FOSS Should Interest You?
Many Proven Solutions
• Open source products widely used in many domains
o The LAMP stack (Linux, Apache, MySQL and
PHP/Perl/Python)
o Lucene, Drupal, Zope, SVN, Firefox, OpenOffice, etc.
Why not use the ones in the translation domain too?
Why FOSS Should Interest You?
They are used already
• FOSS projects use FOSS translation tools
• Some LSPs do use open-source translation tools
• Some translators as well (and not just to translate
FOSS projects)
Why FOSS Should Interest You?
Maturity
• Some projects have been established for a long time
• User-driven improvements over the years
Why FOSS Should Interest You?
Innovation
• The Bazaar model often leads to
more creative and innovative solutions
• Example: Collaborative translation in FOSS
long before it was in the "industry"
Why FOSS Should Interest You?
Interoperability
• Important part of FOSS projects, because they often are
meant to be used along with others FOSS projects
•
•
•
•
•
Wide support for TMX, PO
Increasing support for XLIFF, SRX
Web services
Documented APIs
Command-line interfaces
Some Advantages and Drawbacks
+ You don't have to pay for software licenses
- You still have to spend time to install and maintain things
- There is no technical support
+ There is almost alway a users group that can help (for free)
and some projects have a commercial side too
- What if the main developers of the project leave?
+ You still have the source and possibly a community
- Developers may not implement what you need
+ Money can be an incentive, or branching a solution
Let's Explore...
• Only active projects related to translation
• Not even all of them, just a selection
Categories:
•
•
•
•
•
Libraries and tools sets
Translation management systems
Translation editors
Machine translation systems
Checkers and QA tools
Demonstrations
The best way to discover the tools is to use them.
Two scenarios:
1. Integration with proprietary system
2. Integration with other open-source applications
Scenario 2
• Archibald the PM uses Rainbow to pre-translate MIF, Word
and XML files from a GlobalSight TM and send them for
translation to a Transifex server.
• Pedro the translator uses his own Translate Toolkit TM
server to provide additional leverage, then uses OmegaT,
an Okapi plugin, Hunspell, LanguageTool and Apertium
to finish the translation of the Transifex files.
• Archibald post-processes the translated documents.
9 different open-source tools
And Many More...
•
•
•
•
•
•
•
•
•
•
•
•
•
Anaphraseus (translation editor in OpenOffice)
Boltran (online translation editor)
Bitext2Tmx (aligner)
Gaupol (sub-title translation)
KBabel (PO editor)
Lokalize (translation editor)
M4Loc (Moses integration tool)
POEdit (PO editor)
Open Language Tools (XLIFF editor)
OpenLogos (MT system)
Qt Linguist (translation editor)
TinyTM (TM server)
Zanata (translation management system)
…Etc.
Summary
• Many different tools available for several tasks
o Editing
o Quality assurance
o Project management
o Translation assistance (Terminology, TM, MT, etc.)
• Open standards allow complementing existing
processes with open-source solutions
• Entire enterprise processes based
on open-source tools are possible
• Not a solution for everyone,
but one worth exploring
For More Information
•
•
•
•
•
•
•
•
•
•
Apertium – apertium.org
GlobalSight – globalsight.com
Hunspell – hunspell.sourceforge.net
LanguageTool – languagetool.org
Moses – statmt.org/moses
Okapi Framework, Rainbow & CheckMate – okapi.opentag.com
OmegaT – omegat.org
OpenTM2 – opentm2.org
Transifex – transifex.org
Translate Toolkit, Pootle & Virtaal – translate.sourceforge.net
FLOSS Manual on open translation tools:
en.flossmanuals.net/open-translation-tools/
Thanks!
Any Questions?
[email protected]
[email protected]
Credits for this presentation:
•
•
•
•
Indifex provided the Transifex server access
Jordi Serratosa (@jordis) did the Catalan translations
Universitat d'Alacant provided the Apertium server access
Welocalize provided the GlobalSight server access
And many thanks to all the developers of the various open-source
projects for their hard work.
Appendix
Libraries and Tools Sets
Started in…
License
Platform /
Language
Remarks
Translate
Toolkit
October
2003
GPL
Independent /
Python
Initial build around the Gettext
facilities and PO files,
Working with more formats now.
Works best with software strings (vs.
documentation)
Integration: Python API, commandline toolset, Web service (tmserver)
Okapi
Framework
Opensourced in
November
2005
LGPL
Independent /
Java
Components: filters, connectors,
steps, etc.
Rainbow and Tikal tools
Integration: Java API, command-line
tools, Rainbow functionalities
available through Longhorn Web
services
Translate Toolkit
Okapi Framework
Translation Management Systems
Pootle
(Web-based
server
application)
Transifex
(Web-based
server
application)
GlobalSight
(Web-based
server
application)
Started in…
License
Platform /
Language
Remarks
December
2004
GPL
Independent /
Python
Existed long before "crowdsourcing" was a term.
Example of use: the translation
system of Launchpad, the
backbone of Ubuntu translations.
Summer
GPL
2007 as a
GSoC project
Independent /
Python
Mainly PO and TS files
Integration with many SCMs
Integration: API, Web services,
command-line client.
Opensourced in
January
2009
Apache 2.0 Windows,
Linux / Java
Complex system, many features
Integration with TDA TMs
Integration: Client tool, Web
service API
Pootle
Transifex
GlobalSight
Translation Editors
Started in…
License
Platform /
Language
Remarks
OmegaT
2002
GPL
Independent /
Java
Very active community
Other projects use OmegaT at
their core: e.g. Boltran, Benten
Integration: Plugin API,
commend-line mode.
Virtaal
2007
GPL
Independent /
Python
Uses Translate Toolkit
OpenTM2
Opensourced in
June 2010
EPL
Windows
OS/2 type user-interface
LISA wanted to make it a way to
provide reference
implementation for standards
OmegaT
Virtaal
OpenTM2
Machine Translation Systems
Started in…
License
Platform /
Language
Remarks
Apertium
2005
(OpenTrad
project)
GPL
Linux,
Rule-based system
Windows / C++ Some of the language pairs:
Catalan<->Spanish,
Spanish<->English,, etc.
Integration: Web services,
command-line
Moses
2006
LGPL
(EuroMatrix,
etc.)
Linux,
Statistical-based system
Windows / C++ Other projects are related:
- Moses For Localization (M4Loc)
- Moses for Mere Mortals
- Do Moses Yourself CE
Integration: Web services,
command-line.
Apertium
Moses
Checkers and QA Tools
Started
in…
License
Platform /
Language
Remarks
LanguageTool May 2004
LGPL
Independent /
Java
Grammar and QA checker
Rule based, extendable
Has defaults for many languages
Integration: Java API, Web services
Hunspell
July 2005
GPL, LGPL,
MPL
Independent /
C++
Spell-checker
Used in many other tools
Integration: API
POFilter
2004
GPL
Independent /
Python
Part of the Translate Toolkit
Software string verification
Integration: Command-line
CheckMate
2010
LGPL
Independent /
Java
Part of the Okapi project
Integration: Java API
LanguageTool
Hunspell
PO Filter
CheckMate