2. Beta 1 released at SSP
http://blogs.msdn.com/exscientia/archive/2008/
05/30/beta-of-microsoft-s-article-authoring-add-
in-now-available-broadly-for-download.aspx
If you have any questions or feedback, feel
free to contact me through the blog
We can also connect on Facebook and you
can follow me on Twitter (pablofe)
3. Content, Metadata, and XML
Word 2007
NLM and Docx Formats
Author Experience
Behind the Scenes Look
Editor Experience
What’s Next?
Downloads
4. At the beginning of the transition to digital
consumption
Fully realizing benefits from transition to
digital content will depend on structured
data
Improved search results
Semantic analysis based recommendations
and relationships
5. Connected lifestyle and consumption of
digital content
Technology enablers
Access to computers and connectivity
XML formats
Mainstream use of XML in authoring
6. Provide an experience that assists and incentives
authors to enter a core set of metadata and
content semantics
Capture author intent and domain expertise
Authors need not become aware of the underlying
format
Re-use author metadata to simplify submission
experience
Preserve data through publishing and archival
7. Free add-in for Word 2007
Open and save documents to the NLM format
Templates for structured content
Metadata editing and validation
Support for NLM Book format
Integration with Design Science’s MathType control
Releasing Beta 1 this week
We welcome your feedback, requirements, and
scenarios
http://blogs.msdn.com/exscientia/
9. Full access to Metadata editing within Word
Access to article information within docx
files
Validation against DTD and style guide
Connect content to data or additional
material
10. Journals and conferences are still two of
the main conduits for new information
As content became available electronically,
search provided an additional way of
locating new content
New ways of discovering content may
come from semantic analysis,
recommendations, and social connections
11. Search and automated recommendations
depend on structured information, in
addition to full text content
XML
Basis for structured content
Part of mainstream workflows
Now part of authoring
Metadata provides one of the means for
deriving insights into the content and its
connections
14. Separation of presentation from semantics
Structured content:
Beneficial for search
Beneficial for long term archival
Can even be used to derive presentation
15. Enable authors to express information as
part of the writing process
Metadata
Author and co-author information
Keywords and taxonomy
Content
Sections, statements, and other semantic
information
16. Brings together fundamental advances
XML format
Packaging
Content extensibility
User Interface extensibility
Article Authoring Add-in
Builds on native Word UI as much as possible
Lower the learning curve
Simplifies authoring process
Templates, validation, metadata entry
17. Four NLM formats
Journal Publishing tag set (Blue)
Book tag set (Purple)
NLM formats
Extensive metadata
Light on presentation
Docx/OpenXML
Extensive support for presentation and editing
Light on metadata
18. Introduce as few new concepts as possible
Not directly aware of XML or NLM format
Simplicity in usability over features, avoid
taxing the author
Experience driven by templates and
centered around:
Required sections
Metadata (keywords, subject, authors)
20. A docx file is a zip file
Document parts
Images
Custom XML
Metadata is stored within docx files as XML using
the NLM format
Keeps the content and metadata together in a single
file
Word content is augmented with custom XML
elements where needed
21. A second audience for the add-in
Editors, journal and archive staff
Likely to have some specific knowledge of
the NLM format
May want to access all of the metadata
But without having to resort to a XML editor
22. To be created by journals
Subset of a NLM article
Custom properties to indicate
Required and optional sections
Minimum and maximum length for sections
Minimum number of keywords or subjects
Ability for authors to add custom sections or
keywords
23. Author related information
Name, affiliation, email, biography, etc
Simplify entry
Automate author information
Co-authors can be grouped for re-use
Reduce data entry errors
24. Enable other add-ins to be created and to
work alongside the Authoring Add-in
Provide a simple way for content to be
extended or annotated
Enable institutions to use their own
metadata form
25. How best to extend content and have it
preserved in the generated NLM XML file?
Named-Content allows for tagging of terms
Other add-ins can rely on Authoring Add-in and
use this element to annotate the document
content
26. Present a form for authors to fill in
Enable custom workflows within
institutions
Data entry in a structured way and not as
part of the content
Custom Document Information Panel
or InfoPath form
27. Existing or new tools can access the different
parts of a docx file
Validate or convert content
Extract or add metadata
Connect metadata elements to a database
Tools can run on any platform, and on client or
server
Just need a zip library and XML tools
28. Beta 1 available
Download and try it
Provide feedback
Investigating Mac Word 2008 capabilities
What additional functionality would you like
to see in the add-in?
29. Add-in Beta 1
Download here
Project blog - http://blogs.msdn.com/exscientia/
Word File Format Compatibility
Open and save docx files from Word 2003
http://www.microsoft.com/downloads/details.aspx?
familyid=941b3470-3ae9-4aee-8f43-
c6bb74cd1466&displaylang=en
Save as PDF add-in for Word 2007
http://www.microsoft.com/downloads/details.aspx?
FamilyID=4d951911-3e7e-4ae6-b059-
a2e79ed87041&DisplayLang=en
Contact info:
Pablo.Fernicola@microsoft.com
Pablofe on Twitter, also on Facebook