
Add loext:hyphenation-keep-line to the ODF export/import to support HyphenationKeepLine, i.e. MS DOCX compatibility option useWord2013TrackBottomHyphenation=false, where not the full hyphenated line, but only the hyphenated word is shifted to the next text block. Add unit tests for hyphenation-keep-type="page" and "spread" with hyphenation-keep-line="true" support. Follow-up to commit3e02ffb76c
"tdf#i165354 sw offapi DOCX: implement HyphenationKeepLine – part 1". Fix unit test testTdf132599_spread, which loaded tdf132599_page.fodt instead of tdf132599_spread.fodt (duplicating testTdf132599_page). Restore unit test testTdf160518_auto_in_text_body_style, which was removed by accident in commit3e02ffb76c
. Silence the linguistic warning of HyphKeepLine, according to the commitd25de4a046
"silence unknown property 'HyphNoLastWord'... etc warnings". Change-Id: Ibb9643f8e20ee2a1456c4d62b901e4c66b57c341 Reviewed-on: https://gerrit.libreoffice.org/c/core/+/182424 Reviewed-by: László Németh <nemeth@numbertext.org> Tested-by: Jenkins
ODF Import and Export Filter Logic
The main library "xo" contains the basic ODF import/export filter
implementation for most applications. The document is accessed
via its UNO API, which has the advantage that the same import/export
code can be used for text in all applications (from/to Writer/EditEngine).
The filter consumes/produces via SAX UNO API interface (implemented in
"sax"). Various bits of the ODF filters are also implemented in
applications, for example [git:sw/source/filter/xml]
.
There is a central list of all element or attribute names in
[git:include/xmloff/xmltoken.hxx]
. The main class of the import filter
is SvXMLImport, and of the export filter SvXMLExport.
The Import filter maintains a stack of contexts for each element being read. There are many classes specific to particular elements, derived from SvXMLImportContext.
Note that for export several different versions of ODF are supported, with the default being the latest ODF version with "extensions", which means it may contain elements and attributes that are only in drafts of the specification or are not yet submitted for specification. Documents produced in the other (non-extended) ODF modes are supposed to be strictly conforming to the respective specification, i.e., only markup defined by the ODF specification is allowed.
There is another library "xof" built from the source/transform directory, which is the filter for the OpenOffice.org XML format. This legacy format is a predecessor of ODF and was the default in OpenOffice.org 1.x versions, which did not support ODF. This filter works as a SAX transformation from/to ODF, i.e., when importing a document the transform library reads the SAX events from the file and generates SAX events that are then consumed by the ODF import filter.
OpenOffice.org XML File Format
There is some stuff in the "dtd" directory which is most likely related to the OpenOffice.org XML format but is possibly outdated and obsolete.
Add New XML Tokens
When adding a new XML token, you need to add its entry in the following three files:
[git:include/xmloff/xmltoken.hxx]
[git:xmloff/source/core/xmltoken.cxx]
[git:xmloff/source/token/tokens.txt]