OTRS plugin for Supybot

Posted on: Sun 18 May 2014

Estimated read time: 2 minutes

OTRS is quite different to Bugzilla (what we use for upstream LibreOffice development for quite some time). On the plus side, e.g. it has strong support for multiple customers. OTOH, it deals with tickets instead of bugs, and sadly it doesn’t have a single identifier for tickets. It has a ticket number (which by default even includes the date), which is searchable, and it has a ticket ID, which is used for URL’s.

In case of Bugzilla, you can easily lookup "bug#12345" in Firefox. Create a bookmark with the following properties:

Location: https://localhost/bugzilla/show_bug.cgi?id=%s
Keyword: bug

and then you can just copy&paste bug#12345 to Firefox, replace the # with a space, and Firefox will do the right thing.

Unfortunately (due to the above detailed reasons), this is not possible with OTRS. So I decided to write a Supybot plugin that can notice "bug#12345" on an IRC channel, and give you the clickable URL (after finding out the ticket ID from the ticket number).

The result is available on GitHub, it looks like this:

09:58 < vmiklos> bug#1000068
09:58 < supybot> https://localhost/otrs/index.pl?Action=AgentTicketZoom;TicketID=73

Given that I found no relevant hits when searching for supybot otrs, I hope this code may be useful for others as well.

Thanks to James Scott for his YouTube plugin that helped to quickly figure out the relevant Supybot API’s.

Improved support for text frames with relative sizes in LibreOffice Writer

Posted on: Wed 02 April 2014

Estimated read time: 2 minutes

https://lh6.googleusercontent.com/-YDWSLjQcV1U/UzxP5AJhnyI/AAAAAAAAEIo/8uurEY7s90k/s400/

When using text frames in Writer, you can always choose if you set an absolute size for it or you set a relative one. Oddly enough, in case of relative sizes, it wasn’t entirely clear what 100% percent means. With a bit of searching, the help says "it’s the page text area", which in practice means the page size, excluding the margins.

And that’s where the problem lies: in many cases (importing foreign formats, cover page of a document, etc.) you want to have a textframe which is 100% wide, compared to the full page size, including margins. It was already possible previously to work this around by manually specifying the same size what was used for page size, but that’s ugly, you duplicate the setting at two places.

As you can see on the above screenshot, in LibreOffice 4.3, I now implemented this as a new option, you can choose what 100% means for both width and height. File filters are also updated accordingly: in case of ODF an extension is proposed, and also DOCX and RTF filters are updated, where the file format already supported this feature.

For the curious ones, the feature is in master for almost two months now, but I only implemented my favorite part — RTF filter — only last week, that’s the "news" here. ;-)

If you want to try these out yourself, get a daily build and play with it! If something goes wrong, report it to us in the Bugzilla, so we can try fix it before 4.3 gets branched off. Last, but not at least, thanks for CloudOn for funding this improvement! :-)

DOCX import progressbar in LibreOffice Writer

Posted on: Sat 22 March 2014

Estimated read time: 1 minutes

https://lh5.googleusercontent.com/-sW_Ccet7HMU/Uy2hcmf7ZsI/AAAAAAAAEHg/IGwPJI6h6vE/s400/

I’m sure in this case a few words are worth more than the above picture, so let me describe what you see above. :-)

In case of opening an ODT, DOC or RTF document in LibreOffice Writer, you already got some feedback on where the importer is, in case the process needed more time than what you feel "instant". However, this wasn’t supported for DOCX. According to git blame, I added this to my todo on 2012-10-29, and a few months later also a bugreport was opened, requesting the same, but up to yesterday, nothing changed. However, now I’ve implemented this on master, it’ll be part of the 4.3 release.

Back to where I started, what you actually see there is when LibreOffice is in the middle of the import process of the Holy Bible in DOCX format, which takes around 12 seconds on my machine. One could say that speed up quite acceptable for that amount of data, but with a progressbar, it’s definitely better. ;-)

Death of doctok in LibreOffice

Posted on: Sun 09 March 2014

Estimated read time: 3 minutes

https://lh5.googleusercontent.com/-IumGq3jzNdI/UxxG3Deg6QI/AAAAAAAAEFY/Ij7oMS7YyoY/s300/

Last year in September we decided to get rid of the writerfilter-based DOC tokenizer, and I volunteered to actually do this. As cleanups in general have a low priority, I only progressed with this slowly, though yesterday I completed it, that’s why I’m writing this post. :-)

Some background: the writerfilter module is responsible for RTF and DOCX import in Writer. As the above picture shows, the currently used DOC import is independent from it, and there was also an other DOC import filter, that was in writerfilter which was disabled at runtime. As I don’t like duplication, I examined the state of the two filters, and the linked minutes mail details how we decided that the old filter will stay, and we’ll get rid of the writerfilter one. It’s just a matter of deleting that code, right? :-) That’s what I thought first. But then I had to realize that the architecture of writerfilter is a bit more complex:

https://lh3.googleusercontent.com/-vBMsCsYLRZE/UxxG3PsuI2I/AAAAAAAAEFc/Ly8sQ0ydY1A/s400/

It has the following components:

the dmapper (domain mapper), that handles all the nasty complexities of mapping Word concepts to Writer concepts (think of e.g. sections ↔ page styles)
one tokenizer for each (RTF, DOCX, DOC) format

The traffic between the tokenizers and dmapper is called tokens. Naturally it’s not enough that tokenizers send and dmapper receives these tokens, they should be defined somewhere as well. And that’s where I realized this work will take a bit more time: instead of having a single token definition, actually the ooxml tokenizer defined its own grammar, and doctok also defined two additional grammars. And of course dmapper had to handle all of that. ;-) Given that OOXML is a superset of the DOC/RTF format, it makes sense to just use the ooxml grammar, and get rid of the other two.

Especially that — by now you probably found this out — if I wanted to kill doctok, I had to kill the sprm and rtf grammars as well. Otherwise just removing doctok would break the RTF and DOCX import as well, as those also used the rtf/sprm grammars.

So at the end, the cleaned up architecture now looks like this:

https://lh5.googleusercontent.com/-Iki8jKN61vE/UxxG3FaQc2I/AAAAAAAAEFQ/hWFsR20qB5o/s400/

And that has multiple advantages:

It removes quite some code: In libreoffice-4-1, the doctok was 78849 (!) lines of code (well, part of that was XML data, and some scripts generated C++ code from that).
dmapper now doesn’t have to handle the rtf and sprm grammars anymore, so now there is a single place in dmapper that handles e.g. the italic character property.
Smaller writerfilter binary for the end user: even if doctok wasn’t enabled at runtime, it was shipped in the installation set.
Hopefully it’s now a bit more easy to understand writerfilter: at least e.g. if you want to look up the place where dmapper handles the character bold ("b") XML tag of OOXML, you don’t have to know that the binary DOC equivalent of that is sprmCFBold, just because we have an unused DOC tokenizer there as well. :-)
Given that DOC and RTF formats are a dead end, I think it’s a good thing that in writerfilter now the grammar is OOXML (that keeps introducing new features), rather than some dead format. ;-)

LibreOffice Writer now supports nested comments in its DOC/RTF filters

Posted on: Fri 28 February 2014

Estimated read time: 1 minutes

https://lh3.googleusercontent.com/-FdnXBOBQWjc/UxA1LD0-fyI/AAAAAAAAEDk/2hYmN8ASkxA/s400/comments-nested.png

If you ever tried to use nested comments in Writer (make a selection, Insert → Comment, then make an overlapping selection, and do it again), you may have noticed that only the ODF filter can load and save such a document properly. Recently I have improved this situation a lot. Motivated by seeing this is now supported in the DOCX import filter, I now added support for this also to the DOCX export, RTF import/export and binary DOC import/export filters.

If you want to try this out, core.git has a ODT and DOC samples to play with.

InteropGrabBag in LibreOffice Writer

Posted on: Thu 06 February 2014

Estimated read time: 1 minutes

https://lh5.googleusercontent.com/-XoQPls8ulH4/UvOP_KUNG8I/AAAAAAAAEA0/BSJLDOfvLUA/s400/interopgrabbag-fosdem-brussels-2k14-0.jpg

I’ve arrived home yesterday from Brussels where I presented at FOSDEM 2014, in the Open document editors devroom.

We also had a Hackfest, kindly hosted by Betacowork on Monday and Tuesday.

Here are a few talks I enjoyed, not counting the LibreOffice ones:

Simplifying reuse with metadata support in ODF and plugin APIs — finally a simple way to copy&paste images from Firefox to LibreOffice with source attribution.
Combining the power of Valgrind and GDB — useful to know about this possibility
Two uses cases for the clang C++ parser: Online Code Browser and Qt moc Replacement. — maybe the first could replace our crappy Java-based opengrok? We’ll see.

I was also happy to meet Jacobo, Matus, Tim and Tomaž finally personally. :-)

Quite some other slides are now available on Planet, don’t miss them. I also took some pictures, available here, including photos of all speakers in our devroom.

OOXML shape improvements in LibreOffice Writer 4.3

Posted on: Sat 11 January 2014

Estimated read time: 2 minutes

Although LibreOffice 4.2.0 is not yet released, it was already branched off from master in November last year, and improvements for the next release are already cooking in master. One of these will be a major improvement of shape handling in the DOCX import/export filter.

Some background: when DOCX was initially introduced, it still used VML (which is in short an XML equivalent of the binary shape format), and only Word 2010 started to write shapes using drawingML. Given that Word still understands VML, it wasn’t urgent for us to write shapes using the drawingML markup. As for import, Word still writes an approximate version of the shape in VML as a fallback — that’s what we read till now. Needless to say, newer drawingML features have no VML equivalent so with time it became more and more important for us to finally read and write shapes in DOCX using drawingML, which just happened in Writer.

I’m posting here a few screenshots showing the improvements I’ve implemented. Note that final 4.3 is still far from being released, so this is not a complete list. :-) In each case I’m providing a screenshot showing how it looked (at the end of an import/export/import again roundtrip) before, how it looks now in 4.3 and the reference layout. Click on the images to get a larger image:

document with different colors (test doc):

OK, this has four pictures: before, now, Word 2007 and Word2010. As you can see now we’re now on par with Word 2010. ;-)

document with textboxes inside a group shape (test doc):

document with textbox, having a special RelOrientation (test doc):

document with a shape having a custom adjustment (test doc):

document with different colors (test doc):

A LibreOffice Draw GEDCOM import filter

Posted on: Sat 28 December 2013

Estimated read time: 1 minutes

https://lh4.googleusercontent.com/-4110I0vkRE4/Ur8-uJ6YcII/AAAAAAAAD44/u1pNo4z1y-M/s400/screenshot.png

You may remember that I wrote about my little ged2dot project about two months ago. It’s a Python script that can be used from commandline with quite some options. I’m happy to announce that there is a GUI for it now, in the form of a LibreOffice Draw import filter. If you open this GEDCOM file, you’ll see something similar to the above image. You can simply build the .oxt from the Git repo, or just get it from the LibreOffice Extensions site.

Technical details for the interested readers: the import filter just glues together existing pieces. First it runs ged2dot, then dot to generate an SVG. To keep things simple, then the filter "inlineizes" all the included images, finally the already existing SVG import filter does the real work. I find it elegant that each step works from and to the memory, and not ugly temporary files. :-) It works on Linux and Windows, feedback on if it works on Mac is appreciated.

Also, if you ask why this is an extension, not part of LibreOffice core: I guess most users are not interested in building family trees; also being someone who works on the core most of the time, I wanted to try out how implementing an import filter as an extension (filter detection, import options dialog, etc.) works. ;-)

Software Freedom Day Hungary, 2013

Posted on: Sat 30 November 2013

Estimated read time: 1 minutes

https://lh5.googleusercontent.com/-_PCiCvnpRa8/UpoG4lCMFFI/AAAAAAAADyI/0n0ZpQsXREQ/s400/sfd-plakat-2013-kicsi.png

This year’s Software Freedom Day event was held yesterday at Szeged, here at Hungary. I gave a talk about the news of LibreOffice 4.1 / 4.2 (slides), and I was happy to notice there were other LibreOffice-related talks as well:

Péter Mátó was talking about central configuration management, the LibreOffice-related bit is this commit;
László Németh was talking about a LibreOffice extension, but in practice he shared quite some details about how to contribute (and his usual topics like LibreLogo ;-) )

I would like to thank the organizers all their work, it was a pleasure to attend this event! :-)

LibreOffice Writer can now write .dot files

Posted on: Tue 12 November 2013

Estimated read time: 1 minutes

https://lh6.googleusercontent.com/-MFCwDOTVK1o/UoHtE8V1k8I/AAAAAAAADuU/dKjop88lNxo/s400/dotexport.png

It was pointed out that LibreOffice Writer is only able to open, not save .dot (Word binary template) files. From mso-dumper experience, I knew that the difference is quite small between .doc (which we already export) and .dot files. The feature freeze for LibreOffice 4.2 is here next week, so why not adding this feature before that deadline? You can see above the result. :-)

For the ones who love our --convert-to commandline switch, an extra fix was necessary to make --convert-to dot work as well, both commits are now in master. You can try it out yourself using a daily build or just wait for the libreoffice-4-2 branchoff and the first beta from that branch.

Happy templating! ;-)