Skip to content

pulibrary/Scripto

 
 

Repository files navigation

Scripto (plugin for Omeka)

Instructions for loading Figgy records

  1. Find your item in Figgy
  2. Click on it to view the show page, then copy the figgy_id from the URL (ID is in bold: https://figgy.princeton.edu/catalog/*4b73d943-b005-4c02-ae96-19060ad8b43c*)
  3. Clone this repo to your local drive and cd into it
  4. Run ruby ./figgy_to_csv.rb {figgy_id}
  5. Log into http://transcribe.princeton.edu/admin
  6. Click CSV import, select the item file to upload, and map the fields appropriately. (Make sure to check the Files? column for the last row labeled files.)
  7. Click CSV import again, select the files file to upload, and map the fields appropriately. (Make sure to check the filename? column for the first row labeled filename.)
  8. Make note of the item id in Omeka (part of the public url)
  9. Log onto the server ssh pulsys@libphp-prod.princeton.edu
  10. sudo su - drupal
  11. cd /var/www/apps/transcribe/plugins/Scripto
  12. Add image numbers to the thumbnail pages: php update_order.php {item_id} 96 (note: that second argument is the element_id for Weight)
  13. If you want to double check the element_id, Log into the mysql database, use the transcribe_omeka db, and run select * from elements where name = "Weight"

Summary

Scripto is an open-source tool developed by Center for History & New Media that allows collaborative transcription of digital files. Built on MediaWiki, Scripto is intended to be used as an extension of a content management system. Scripto is currently available as a plugin for Omeka, Wordpress, and Drupal.

DIYHistory|transcribe uses Scripto with Omeka. Visit DIYhistory|code for an overview of the technology stack and implementation information.

Features of the Scripto plugin for Omeka can be found at Github|Scripto. The UI Libraries fork of plugin-Scripto adds the following features:

  • Track completion status of document pages (i.e., ‘Not Started’, ‘Needs Review’, ‘Completed’).
  • Track completion progress of documents based on page statuses.
  • Sort documents within their collection by most completed, floating least completed to the top.
  • Initialize document page text entry box with pre-existing text, if available (helpful if using Scripto to correct OCR for typescript pages).
  • Fill document pages with a pre-existing text, if available in a field element of each file, via a button in the admin item pages (helpful to re-initialize a list of document pages at any time).
  • The source element used to initialize or to reinitialize a document page can be chosen (helpful to keep track of original source). Scripto:Transcription is used by default. It should be a file level element.
  • On every page action, automatically import transcriptions from MediaWiki as file metadata.
  • Possibility to define items that should not be transcribed via the admin page items/show.
  • Choice of the order of the pages inside a document by predefined order, original filename or file id.
  • Choice of the derivative image to display (default is the original one, but fullsize or any other derivative can be used).

The UI Libraries has also created an Omeka theme, Scribe, to make use of these new features.

To accommodate these features, UI Libraries has added four elements to the Scripto metadata element set: “Status”, “Percent Needs Review”, “Percent Completed” and “Weight”. These values are calculated and updated with every “Save”, “Approve”, or “Unapprove” action made on a document page.

“Status” stores the completion status of a document page (i.e., “Not Started”, “Needs Review”, “Completed”) in the page-level metadata. Scribe displays this value on each page thumbnail in the items/show page.

In the document (item) level metadata, “Status” stores the status of a document. Currently, this is used only for the status “Not to transcribe”. Values for “Percent Needs Review” and “Percent Completed” represent the percentage of pages within the document carrying the “Needs Review” and “Completed” statuses, respectively. These percentages are used by Scribe to display the stacked progress bars for each item in the collections/show page. The 6-digit sorting number stored in “Weight” is used by Scribe to sort items on collections/show page by completion progress, sinking the most completed items to the bottom and floating the least completed to the top.

Installation

Uncompress files and rename plugin folder "Scripto".

Then install it like any other Omeka plugin and follow the config instructions.

The new fields for the Scripto element set will automatically be installed with a fresh install of the UI Libraries fork of plugin-Scripto or with an upgrade of the upstream version.

Metadata

Metadata and content files are uploaded to Omeka using the fork of the plugin CsvImport. CSV Import is an Omeka plugin for batch uploading metadata and content file to Omeka. The UI Libraries fork of this plugin allows for the batch uploading of page-level (file) descriptive metadata. To upload using the UI Libraries’ version, first upload the csv file of items as instructed in the CsvImport documentation (choosing ‘Record Type: Item’ in the Csv Import interface). After uploading and creating the items, upload the csv file of page-level metadata (choosing ‘Record Type: File’ in the CsvImport interface). Map the Original filename to ‘Filename?’ in Step 2 of the CSV Import interface. See _sample-data Item.csv and _sample-data File.csv for an example. If you are pulling content from your version 6 CONTENTdm installation, you may find this sample gist helpful in generating csv upload files.

Document-level (item) metadata

Element

Omeka element

Comments

Title

dc:Title

The title of the document

Source URL

dc:Source

(optional) The URL for the original location or master record for the document (If you are replicating the document from another digital environment).

Source identifier

dc:Identifier

(optional) An identifier for the document that ties it to the original location or master record (If you are replicating the document from another digital environment). Note that Omeka will generate a system identifier for each document (item).

Digital collection URL

dc:Is Part Of

(optional) The URL for the digital collection that the document belongs to in its original digital environment (If you are replicating the document from another digital environment).

Finding aid URL

dc:Relation

(optional) The URL for the finding aid of the document’s source collection.

Sorting number

Scripto:Weight

6-digit number for sorting the item within its collection in the collections/show display. Set to ‘000000’ as default. This will get updated every time a document page from the document is saved, approved, or unapproved.

Percent needs review

Scripto:Percent Needs Review

Percentage of pages with status ‘Needs Review’. No default needed. This will get updated every time a document page from the document is saved, approved, or unapproved.

Percent completed

Scripto:Percent Completed

Percentage of pages with status ‘Completed’. This will get updated every time a document page from the document is saved, approved, or unapproved.

Page-level (file) metadata

Element

Omeka element

Comments

Original filename

The file location specified in the csv item upload file. The UI Libraries fork of plugin-CsvImport uses this filename to find the Omeka file record for applying the page-level metadata.

Page label

dc:Title

The label for the page

Page-level source URL

dc:Source

(optional) The URL for the original location or master record for the document page (If you are replicating the document from another digital environment).

Source identifier

dc:identifier

(optional) An identifier for the document page that ties it to the original location or master record (If you are replicating the document from another digital environment). Note that Omeka will generate a system identifier for each document page (file).

Transcription

Scripto:Transcription

The transcription for the document page. This will get updated every time a document page is saved or approved. You may pre-populate this field on ingest to Omeka with OCR or existing transcription, if desired.

Status

Scripto:Status

Completion status of the document page (‘Not Started’, ‘Needs Review’, ‘Completed’). Set to ‘Not Started’ as default. This will get updated every time a document page is saved, approved, or unapproved.

Omeka file order

no map

The order of the page within the document. When used with the ui-libraries/plugin-Csv-Import fork, this value will assure files are in the correct sequence.

Warning

Use it at your own risk.

It's always recommended to backup your files and database so you can roll back if needed.

Troubleshooting

See online issues on the Scripto issues page on GitHub.

License

This plugin is published under GNU/GPL.

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.

Contact

Current maintainers:

First version of this plugin has been built by Center for History & New Media. Next, UI libraries has forked it in order to add some features. This fork has been merged into the main version for Omeka 2 for École des Mines ParisTech and completed.

Copyright

  • Copyright Center for History and New Media, 2008-2013
  • Copyright Shawn Averkamp, 2012-2013
  • Copyright Daniel Berthereau, 2013-2014

About

Plugin for Omeka that adds the ability to transcribe items using the Scripto library.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • PHP 94.7%
  • Ruby 3.8%
  • Other 1.5%