One of the most functional libraries to read and write .docx (a.k.a. Microsoft Word documents or ECMA-376 Office Open XML) files in Go.
This is a variant optimized and expanded by fumiama. The original repo is gonfva/docxlib.
As part of my work for Basement Crowd and FromCounsel, we were in need of a basic library to manipulate (both read and write) Microsoft Word documents.
The difference with other projects is the following:
- UniOffice is probably the most complete but it is also commercial (you need to pay). It also very complete, but too much for my needs.
- gingfrederik/docx only allows to write.
There are also a couple of other projects kingzbauer/docx and nguyenthenguyen/docx
gingfrederik/docx was a heavy influence (the original structures and the main method come from that project).
However, those original structures didn't handle reading and extending them was particularly difficult due to Go xml parser being a bit limited including a 6 year old bug.
Additionally, my requirements go beyond the original structure and a hard fork seemed more sensible.
The plan is to evolve the library, so the API is likely to change according to my company's needs. But please do feel free to send patches, reports and PRs (or fork).
In the mean time, shared as an example in case somebody finds it useful.
The Introduction above is copied from the original repo. I had evolved that repo again to fit my needs. Here are the supported functions now.
- Parse and save document
- Edit text (color, size, alignment, link, ...)
- Edit picture
- Edit table
- Edit shape
- Edit canvas
- Edit group
go run cmd/main/main.go -u
And you will see two files generated under pwd
with the same contents as below.
p1 | p2 |
go get -d github.com/fumiama/go-docx@latest
package main
import (
"os"
"strings"
"github.com/yangge2333/go-docx"
)
func main() {
w := docx.NewA4()
// add new paragraph
para1 := w.AddParagraph()
// add text
para1.AddText("test").AddTab()
para1.AddText("size").Size("44").AddTab()
f, err := os.Create("generated.docx")
// save to file
if err != nil {
panic(err)
}
_, err = w.WriteTo(f)
if err != nil {
panic(err)
}
err = f.Close()
if err != nil {
panic(err)
}
}
package main
import (
"fmt"
"os"
"strings"
"github.com/yangge2333/go-docx"
)
func main() {
readFile, err := os.Open("file2parse.docx")
if err != nil {
panic(err)
}
fileinfo, err := readFile.Stat()
if err != nil {
panic(err)
}
size := fileinfo.Size()
doc, err := docx.Parse(readFile, size)
if err != nil {
panic(err)
}
fmt.Println("Plain text:")
for _, it := range doc.Document.Body.Items {
switch it.(type) {
case *docx.Paragraph, *docx.WTable: // printable
fmt.Println(it)
}
}
}
AGPL-3.0. See LICENSE