Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Speaker Deck
PRO
Sign in
Sign up
for free
Pure Python Microsoft Office Word support
Mike MacCana
April 19, 2012
Technology
1
170
Pure Python Microsoft Office Word support
Lighting talk from PyCon 2010 on Python DocX.
Mike MacCana
April 19, 2012
Tweet
Share
More Decks by Mike MacCana
See All by Mike MacCana
mikemaccana
0
61
mikemaccana
2
79
mikemaccana
1
140
mikemaccana
0
81
mikemaccana
0
65
mikemaccana
0
82
mikemaccana
1
120
mikemaccana
1
100
mikemaccana
2
65
Other Decks in Technology
See All in Technology
ryusa
2
350
_kensh
1
210
shomaekawa
3
1.3k
bells17
1
690
papix
0
270
yunon_phys
1
450
keiya01
21
8.6k
sumi
0
590
sasakendayo
2
450
terurou
0
240
meteatamel
0
430
shirayanagiryuji
1
2.2k
Featured
See All Featured
3n
163
22k
samlambert
237
9.9k
tanoku
86
8.5k
kneath
219
15k
shpigford
165
19k
wjessup
338
16k
robhawkes
52
2.8k
sachag
446
36k
yeseniaperezcruz
302
31k
morganepeng
92
14k
philnash
8
510
akmur
252
19k
Transcript
Python Docx A bullshit-free way to create, edit, and read
MS Word documents @mikemaccana github.com/mikemaccana/python-docx Friday, 19 February 2010
Previous Approaches Approach Using Shitty Old Technology Calling Java Apache
POI Java Automating OpenOffice PyUNO OpenOffice Automating MS Office COM COM MS OpenXML SDK IronPython Windows Friday, 19 February 2010
Making Documents Paragraphs Bullets/Numbering Headings Tables Images Section and Page
Breaks Stuff for screen readers Document Properties Friday, 19 February 2010
Reading Documents Extract plain text of document Originally I intended
this for Python programmers Has since found popularity with ANGRY UNIX GUYS. And document properties too. Friday, 19 February 2010
Manipulating Existing Docs Add and delete items anywhere within the
document Search + replace Friday, 19 February 2010
Serving Suggestions Python LinkedIn API - auto-build a Word doc
whenever some old recruiting dude asks for one. Lamson - transparently fuck with people’s email attachments. NLTK - pass content to NLTK for logical inference, identifying people / org relationships, etc. Friday, 19 February 2010
Future I’m going to fucking have to do styles at
some point. Fuckity fuck fuck fuck. Friday, 19 February 2010
Join us! <1000 lines - lxml does all the heavy
lifting nose (currently 100% coverage) Example code for everything MIT licensed Friday, 19 February 2010
github.com/mikemaccana/python-docx @mikemaccana Thankyou Friday, 19 February 2010