{"id":8418,"date":"2012-05-25T09:00:03","date_gmt":"2012-05-25T13:00:03","guid":{"rendered":"http:\/\/mith.umd.edu\/?p=8418"},"modified":"2020-10-08T16:02:14","modified_gmt":"2020-10-08T20:02:14","slug":"almost-ready-for-prime-time","status":"publish","type":"post","link":"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/","title":{"rendered":"Almost Ready for Prime Time"},"content":{"rendered":"<p>We now have two versions of a demos up and ready to run. Both allow a user to pull data from the witness files, containing manuscript transcriptions, select texts to compare, run the texts through a version of CollateX, then present the results as an alignment table (a \u201csynopsis\u201d in or \u201cpartitur\u201d in some text-critical dialects), and as a text with apparatus.<\/p>\n<p>The second of these is still buggy (and the cause of both a couple of late nights night and the lateness of this post (for which I apologize heartily to the nice people at MITH)), but it does a couple of additional things:<\/p>\n<ul>\n<li>Prioritization. While the ability to generate all sorts of different apparatus is a desideratum, at present what we can do is choose the order in which results are presented, and, in the case of presenting a text with apparatus, the first text chosen becomes the base text for comparison.<\/li>\n<li>Tokenizing. I am now able to tokenize in two steps. First with \u201crich\u201d tokens that retain data about the individual words (e.g., abbreviations, which should be compared based on their expanded text rather than on the abbreviation as written), as well as other data in the text (page breaks, etc). From there we can create \u201cregularized\u201d tokens. For now I have regularized the tokens by removing all yods and waws. Additional candidates might include dealing with prepositions that are sometimes but not always attached in medieval Mishnah manuscripts (shel, e.g.), final aleph\/heh, and final nun\/mem. \u201cSimple\u201d tokens are passed to Collatex (or, we allow Collatex to process \u201crich\u201d tokens) and the resulting collation output is merged with the rich tokens.<\/li>\n<li>Presentation. Because the \u201crich\u201d tokens retain information about the witness, it is possible to generate a \u201ctext-with-apparatus\u201d in which the base text can be presented with formatting and contextual information that may be useful to the reader. (Disclaimer: Here is a big bug: The XSLT that joins the two lists of tokens inserts the non-words (page breaks etc.) in a position that is offset by one location. Any suggestions?)<\/li>\n<\/ul>\n<p>Next up: modifying the demo to present multi-column synopses, and linking in Talmudic and Commentary citations.<\/p>\n<p><em>Hayim Lapin is Robert H. Smith Professor of Jewish Studies and Professor in the Department of History at the University of Maryland. He currently is completing a faculty fellowship at MITH. This post originally appeared at<a href=\"http:\/\/www.digitalmishnah.org\/uncategorized\/housekeeping\/\"> Digital Mishnah<\/a> on May 24th, 2012.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>We now have two versions of a demos up and ready to run. Both allow a user to pull data from the witness files, containing [&hellip;]<\/p>\n","protected":false},"author":16,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[70,71],"tags":[110],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v15.0 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Almost Ready for Prime Time &ndash; Maryland Institute for Technology in the Humanities<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Almost Ready for Prime Time &ndash; Maryland Institute for Technology in the Humanities\" \/>\n<meta property=\"og:description\" content=\"We now have two versions of a demos up and ready to run. Both allow a user to pull data from the witness files, containing [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/\" \/>\n<meta property=\"og:site_name\" content=\"Maryland Institute for Technology in the Humanities\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/UMD.MITH\" \/>\n<meta property=\"article:published_time\" content=\"2012-05-25T13:00:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-10-08T20:02:14+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/mith.umd.edu\/wp-content\/uploads\/2018\/10\/MITH-logostack-square-grn.png\" \/>\n\t<meta property=\"og:image:width\" content=\"300\" \/>\n\t<meta property=\"og:image:height\" content=\"300\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/mith.umd.edu\/#website\",\"url\":\"https:\/\/mith.umd.edu\/\",\"name\":\"Maryland Institute for Technology in the Humanities\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":\"https:\/\/mith.umd.edu\/?s={search_term_string}\",\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/#webpage\",\"url\":\"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/\",\"name\":\"Almost Ready for Prime Time &ndash; Maryland Institute for Technology in the Humanities\",\"isPartOf\":{\"@id\":\"https:\/\/mith.umd.edu\/#website\"},\"datePublished\":\"2012-05-25T13:00:03+00:00\",\"dateModified\":\"2020-10-08T20:02:14+00:00\",\"author\":{\"@id\":\"https:\/\/mith.umd.edu\/#\/schema\/person\/4a8c89403fcc21b1d79a56e4c5acbbb2\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/mith.umd.edu\/almost-ready-for-prime-time\/\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/mith.umd.edu\/#\/schema\/person\/4a8c89403fcc21b1d79a56e4c5acbbb2\",\"name\":\"Hayim Lapin\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/mith.umd.edu\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c0363e65b869dae8f81f321c80ce9322?s=96&d=mm&r=g\",\"caption\":\"Hayim Lapin\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","_links":{"self":[{"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/posts\/8418"}],"collection":[{"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/comments?post=8418"}],"version-history":[{"count":1,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/posts\/8418\/revisions"}],"predecessor-version":[{"id":21200,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/posts\/8418\/revisions\/21200"}],"wp:attachment":[{"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/media?parent=8418"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/categories?post=8418"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mith.umd.edu\/wp-json\/wp\/v2\/tags?post=8418"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}