Difference between revisions of "Treating the Traité"
From Mondothèque
(Created page with "==Sources== Scans OCR ==Transcribing the Traité== Wikisource https://github.com/PaulOtlet/traite http://traite.czam.de/en/latest/otlet_traite_1934_FR.html#i-buts-de-la-docu...") |
Dickreckard (talk | contribs) |
||
(23 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | = | + | <div class="intro">Experiments with digital iterations of The Book on the Book.</div> |
− | |||
− | |||
− | ==Transcribing the Traité== | + | == Le livre sur le livre == |
− | Wikisource | + | |
+ | Developers, designers, artists, theoreticians, writers, archivists and copyleft-activists are welcome to join a 2 day booksprint/hackathon based on Paul Otlet's 'Le Traité de documentation: Le livre sur le livre' which entered the Public Domain in 2015. | ||
+ | |||
+ | This dense publication combines the genres of manual, encyclopedia, pamphlet and science-fiction to include many of Paul Otlet's visions on the practice of documentation and the future of books. Lemma's on the current state of censorship, the history of the alphabet or "inventions to be made" alternate with precise descriptions of how to reference a book on an index card, or what would be the ideal working conditions for a documentalist. | ||
+ | |||
+ | Drawing on the work done on wikisource we will experiment with form, materiality and content of the 'Traité de documentation' to create a digital re-edition of The Book on The Book. | ||
+ | |||
+ | Organised by Constant (Mondotheque) in collaboration with Arts2 and The Mundaneum archive center. | ||
+ | |||
+ | [[Le_livre_sur_le_livre|More details (in French)]] | ||
+ | |||
+ | == 1989 Facsimile == | ||
+ | |||
+ | <gallery> | ||
+ | File:IMG 3188.JPG | ||
+ | File:IMG_3192.JPG | ||
+ | File:IMG_3186.JPG | ||
+ | File:TDD_ed1989_theBook.jpg | ||
+ | </gallery> | ||
+ | |||
+ | From the [[place::Public Library of Schaerbeek]], [[Yves Bernard]] borrowed this rare 1989 edition, published by the CLPCF (the association which inherited the archives from Les Amis du Palais Mondial/Mundaneum). | ||
+ | |||
+ | Introductions by [[Robert Estivals]] and [[André Canonne]]: [[file:TDD_ed1989_preface.pdf]] | ||
+ | |||
+ | == Scan Tailor == | ||
+ | |||
+ | [[Tomislav Medak]] spends two days with us at [[place::Akademie Schloss Solitude]] to demonstrate a workflow for digitizing books. I use the opportunity to look at the Traité through the lens of Scan Tailor, "an interactive post-processing tool for scanned pages"<ref>http://scantailor.org/</ref>. | ||
+ | |||
+ | I import the image files exported from the [http://lib.ugent.be/fulltxt/handle/1854/5612/Traite_de_documentation_ocr.pdf pdf] into Scan Tailor and let it treat the Traité with all options set to 'automatic'. It produces exciting artefacts: | ||
+ | |||
+ | {{#ask: [[technology::Scan Tailor]] | format=gallery}} | ||
+ | |||
+ | == Printing the Traité == | ||
+ | |||
+ | The <em>Traité de documentation : le livre sur le livre, théorie et pratique</em> is an almost hypertextual book on documentation, written in the 1930's by Paul Otlet. It has many cross-references, tables and illustrations; at times it is written in encyclopedic style, turns into a passionate manifesto, speculative fiction, and a practical manual for librarians. The pdf I have is badly OCR-ed and too heavy for reading comfortably on a digital device. So this morning I transformed the digital version into something that I can print at a copy shop. | ||
+ | |||
+ | I started with extracting the images from the pdf with the help of the imagemagick convert command: | ||
+ | |||
+ | <code>$ mkdir spreads</code> | ||
+ | |||
+ | <code>$ convert Traite\ de\ documentation\ -\ Paul\ Otlet.pdf spreads/%03d.jpg</code> | ||
+ | |||
+ | <!--more-->Next I removed front- and back-cover (they will be treated separately), and also <code>113.jpg</code> (pages 118-119 are repeated), then cut each spread in half: | ||
+ | |||
+ | <code>mkdir pages</code> | ||
+ | |||
+ | <code>convert spreads/*.jpg -crop 2x1@ pages/%03d.jpg</code> | ||
+ | |||
+ | The properties of the original pdf mention a paper size of 200 × 260 mm (and also that the file was created with <code>ABBYY FineReader</code> on <code>Monday December 3, 2007 16:25:51 CET</code> (This file is already 6 years old ...). I am not sure if the measurements refer to the size of the spread or the single page, but from the detailed description in the catalog of the Universiteitsbibliotheek Gent <ref>http://lib.ugent.be/catalog/rug01:000990276#reference-details</ref> I gather that pages are 26cm high, and will fit comfortably on an A4: <code>431, [12], viii p. : illus. ; 26 cm.</code> | ||
+ | |||
+ | I then simply put all images back into a new pdf: | ||
+ | |||
+ | <code>convert pages/*jpg traite.pdf</code> | ||
+ | |||
+ | Tomorrow I'll have the document printed and bound. Can't wait. | ||
+ | |||
+ | == Transcribing the Traité == | ||
+ | in progress on [http://fr.wikisource.org/wiki/Livre:Otlet_-_Trait%C3%A9_de_documentation,_1934.djvu Wikisource] | ||
https://github.com/PaulOtlet/traite | https://github.com/PaulOtlet/traite | ||
http://traite.czam.de/en/latest/otlet_traite_1934_FR.html#i-buts-de-la-documentation | http://traite.czam.de/en/latest/otlet_traite_1934_FR.html#i-buts-de-la-documentation | ||
+ | |||
+ | == Sources == | ||
+ | Original scans http://lib.ugent.be/fulltxt/handle/1854/5612/Traite_de_documentation_ocr.pdf | ||
+ | OCR https://archive.org/details/OtletTraitDocumentationUgent | ||
+ | |||
+ | == Index == | ||
+ | L'[[index Traité de documentation]] permet d'indexer n'importe quel extrait issu du Traité de documentation<ref>https://fr.wikisource.org/wiki/Livre:Otlet_-_Trait%C3%A9_de_documentation,_1934.djvu</ref>. cité ou référencé sur Mondothèque. Il constitue un nouvel index collaboratif du texte d'Otlet, numérique et mouvant. |
Latest revision as of 14:40, 25 June 2016
Contents
Le livre sur le livre
Developers, designers, artists, theoreticians, writers, archivists and copyleft-activists are welcome to join a 2 day booksprint/hackathon based on Paul Otlet's 'Le Traité de documentation: Le livre sur le livre' which entered the Public Domain in 2015.
This dense publication combines the genres of manual, encyclopedia, pamphlet and science-fiction to include many of Paul Otlet's visions on the practice of documentation and the future of books. Lemma's on the current state of censorship, the history of the alphabet or "inventions to be made" alternate with precise descriptions of how to reference a book on an index card, or what would be the ideal working conditions for a documentalist.
Drawing on the work done on wikisource we will experiment with form, materiality and content of the 'Traité de documentation' to create a digital re-edition of The Book on The Book.
Organised by Constant (Mondotheque) in collaboration with Arts2 and The Mundaneum archive center.
1989 Facsimile
From the Public Library of Schaerbeek, Yves Bernard borrowed this rare 1989 edition, published by the CLPCF (the association which inherited the archives from Les Amis du Palais Mondial/Mundaneum).
Introductions by Robert Estivals and André Canonne: File:TDD ed1989 preface.pdf
Scan Tailor
Tomislav Medak spends two days with us at Akademie Schloss Solitude to demonstrate a workflow for digitizing books. I use the opportunity to look at the Traité through the lens of Scan Tailor, "an interactive post-processing tool for scanned pages"[1].
I import the image files exported from the pdf into Scan Tailor and let it treat the Traité with all options set to 'automatic'. It produces exciting artefacts:
Printing the Traité
The Traité de documentation : le livre sur le livre, théorie et pratique is an almost hypertextual book on documentation, written in the 1930's by Paul Otlet. It has many cross-references, tables and illustrations; at times it is written in encyclopedic style, turns into a passionate manifesto, speculative fiction, and a practical manual for librarians. The pdf I have is badly OCR-ed and too heavy for reading comfortably on a digital device. So this morning I transformed the digital version into something that I can print at a copy shop.
I started with extracting the images from the pdf with the help of the imagemagick convert command:
$ mkdir spreads
$ convert Traite\ de\ documentation\ -\ Paul\ Otlet.pdf spreads/%03d.jpg
Next I removed front- and back-cover (they will be treated separately), and also 113.jpg
(pages 118-119 are repeated), then cut each spread in half:
mkdir pages
convert spreads/*.jpg -crop 2x1@ pages/%03d.jpg
The properties of the original pdf mention a paper size of 200 × 260 mm (and also that the file was created with ABBYY FineReader
on Monday December 3, 2007 16:25:51 CET
(This file is already 6 years old ...). I am not sure if the measurements refer to the size of the spread or the single page, but from the detailed description in the catalog of the Universiteitsbibliotheek Gent [2] I gather that pages are 26cm high, and will fit comfortably on an A4: 431, [12], viii p. : illus. ; 26 cm.
I then simply put all images back into a new pdf:
convert pages/*jpg traite.pdf
Tomorrow I'll have the document printed and bound. Can't wait.
Transcribing the Traité
in progress on Wikisource
https://github.com/PaulOtlet/traite http://traite.czam.de/en/latest/otlet_traite_1934_FR.html#i-buts-de-la-documentation
Sources
Original scans http://lib.ugent.be/fulltxt/handle/1854/5612/Traite_de_documentation_ocr.pdf OCR https://archive.org/details/OtletTraitDocumentationUgent