|
|
NEWSLETTER | JANUARY 2023 STRUCTURING DATA AND CONTENT SINCE 1981
|
|
|
|
|
|
|
Unlocking the Record of American
Creativity With Data Extraction and Content Structure
|
DCL and The New York Public Library (NYPL) will speak at Digital Book World in New York City on January 17. Greg Cram, Associate General Counsel and Director, Information Policy at NYPL and Mark Gross, President at DCL, will present a case study that details how NYPL with the support of DCL has digitized historical records of the US Copyright Office, making those records searchable, accessible, and useful for new product development and more. "Unlocking the Record of American Creativity With Data Extraction and Content Structure" is an ambitious and generous initiative by the NYPL to make books available anywhere at any time for its patrons.
|
|
|
|
|
Document Formats that Leo Hates
|
Leo Belchikov is a Senior Project Manager at Data Conversion Laboratory and has spearheaded many conversion projects over his 25 years at DCL. Internally we often say “Leo is going to hate this” when we encounter a format that is not up to par with his requirements of content portability, accessibility, and flexibility. We asked Leo to share some of his observations about the typical formats he encounters when converting from one format into XML.
|
|
|
|
|
Detecting AI-Created Text With GPTZero
|
A college student—Edward Tian, a 22-year-old senior at Princeton University—recently released an app to detect whether text was written by ChatGPT (oh, and he did this over his winter break!). As organizations and industries are increasingly aware of AI plagiarism and logarithmic narratives, humans want to discern what is written by a human and what is composed by an AI. GPTZero is a free app that allows users to paste text into the application and analyze using two key indicators: "perplexity" and "burstiness." The complexity of text and whether or not the bot is familiar with the text comprises the perplexity measure. Burstiness involves sentence variations. Humans typically combine long and short sentences in writing, while AI sentences are more uniform.
|
|
|
|
|
2023 Publishing Tech Trends Report from Silverchair and Hum
|
Our friends at Silverchair and Hum published the second annual report on publishing technology trends. Hum’s President & Co-Founder, Dustin Smith, looks at five trends likely to shape 2023. Then, a panel of 12 publishing executives and consultants weigh in on their predictions for 2023 and beyond. Download your copy to read the thoughtful considerations around revenue growth, the Nelson memo, digital transformation, and more.
|
|
|
|
|
IT Projections for the Healthcare Industry
|
Healthcare organizations have faced an incredible amount of challenges over the past two years as all aspects of the industry navigated a global pandemic during a time of great technological advances. Healthcare IT News asked some of the leading IT tech company CEOs what they think are critical issues for healthcare tech challenges in the coming year. Projected challenges and solutions include issues related to new interoperability requirements, increasing regulatory requirements, and fast-evolving data modeling hurdles are some of the topics discussed by the leading health IT vendors.
|
|
|
|
|
|
CONFERENCE Digital Book World | 16 to 18 January | New York City
DCL's Mark Gross and The New York Public Library's Greg Cram are speaking Digital Book World, the annual gathering of the publishing industry, returns to New York City after six years. The annual publishing and technology gathering that started over a decade ago in New York City heads home.
[REGISTER NOW]
|
|
|
WEBINAR Tables are Tough! Perfecting an AI Model to Automate Table-to-XML Extraction | 1 February Transforming tabular content into a structured model such as XML or HTML is nearly always a manual or semi-manual process. Data Conversion Laboratory and FuseMachines created an AI model that finds and extracts information from all tables in a document using a combination of Computer Vision (CV) and Natural Language Processing (NLP). We’ll review how we developed and managed a hybrid approach of rules-based processes and machine learning to identify and extract tabular data, and augmented training data to develop an AI model that automates table-to-XML extraction.
[REGISTER NOW]
|
|
|
ONLINE CONFERENCE
NISO Plus Conference 2023 | 14 to 16 February NISO Plus online brings people together from across the global information community to share updates and participate in conversations about shared challenges and opportunities. The focus is on identifying next steps to improve information flow and help solve existing and potential future problems.
[REGISTER NOW]
|
|
|
DCL partners with many global organizations that complement our services and offer a complete workflow solution to our customers. Following are some recent highlights from DCL's Partnership Laboratory.
|
|
The Five Dimensions of Content Standardization™ for Pharma: Making Your Automation and Reuse Strategy a Success
In this white paper, Content Rules describe what content standards are and why they are important. You will learn what the Five Dimensions of Content Standardization™ are and how their content standardization framework makes your content FAIR (findable, accessible, interoperable, and reusable). When you clearly document and enforce standards across all Five Dimensions, you are rewarded with content that can be reused seamlessly. You also reduce risk, time-to-market, and cost, while increasing content quality in all languages.
|
|
|
|
|
Tridion by RWS Kicks Off 2023 With its Own Dedicated LinkedIn Page Visit Tridion on LinkedIn, where they will be talking about the latest industry news, exciting announcements, and all interesting happenings around such topics as structured authoring, technical publications, content management systems, and DITA. RWS's content management platform can deal with structured content (Tridion Docs) as well as web content/digital experiences (Tridion Sites), joined up by personalized, semantically enriched omnichannel content delivery (Tridion Dynamic Experience Delivery).
[VISIT LINKEDIN PAGE]
|
|
|
|
|
What’s the Difference between Anonymization and Redaction of Clinical Trial Data?
When it comes to disclosing data from clinical trials of investigational drugs to the public, the vernacular may seem just as confusing as the process! The terms “anonymization” and “redaction” are used interchangeably regarding transparency and disclosure of clinical trial data. But what are the differences in technique or method behind each? We may think of “anonymization” as assigning a patient a new subject ID, and “redaction” as black-boxing information we don’t want to disclose. But when do we use one approach over the other?
[READ MORE]
|
|
|
|
|
|
|
|
|