<ul> <li>OCR files must be in a text format with a .txt extension. When using ABBYY Hotfolders, use the "Create a separate document for each file" and "Save as: Text (*.txt)" settings.</li> <li>Files must be named using the Catalog Number</li> <li>Since OCR text is linked directly to image, source image must have been previously loaded into database</li> <li>If there are more than one image linked to a specimen, the full file name will be used to identify which image to link the OCR</li> </ul> </div> <div style="margin:15px"> <table style="width:100%;"> <tr> <td> <b>Regular Expression:</b> </td> <td> <input name="speckeypattern" type="text" style="width:300px;" value="<?php echo $procManager->getSpecKeyPattern(); ?> " /> <a id="speckeypatterninfo" href="#" onclick="return false" title="More Information"> <img src="../../images/info.png" style="width:15px;" /> </a> <div id="speckeypatterninfodialog"> Regular expression (PHP version) needed to extract the unique identifier from source text. For example, regular expression /^(WIS-L-\d{7})\D*/ will extract catalog number WIS-L-0001234 from image file named WIS-L-0001234_a.jpg. For more information on creating regular expressions, Google "Regular Expression PHP Tutorial". It is recommended to have the portal manager help with the initial setup of batch processing. </div> </td> </tr> <tr>
<div id="innertext"> <h2><?php echo $specManager->getCollectionName(); ?> </h2> <?php if ($isEditor) { $specManager->setProjVariables($spprid); if ($action == 'Process Images') { if ($specManager->getProjectType() == 'iplant') { $imageProcessor = new ImageProcessor(); echo '<ul>'; $imageProcessor->setLogMode(3); $imageProcessor->setCollid($collid); $imageProcessor->setSpprid($spprid); $imageProcessor->processIPlantImages($specManager->getSpecKeyPattern(), $specManager->getLastRunDate()); echo '</ul>'; } else { echo '<div style="padding:15px;">' . "\n"; $imageProcessor = new ImageBatchProcessor(); $imageProcessor->setLogMode(1); $imageProcessor->initProcessor(); $imageProcessor->setCollArr(array($collid => array('pmterm' => $specManager->getSpecKeyPattern()))); $imageProcessor->setDbMetadata(1); $imageProcessor->setSourcePathBase($specManager->getSourcePath()); $imageProcessor->setTargetPathBase($specManager->getTargetPath()); $imageProcessor->setImgUrlBase($specManager->getImgUrlBase()); $imageProcessor->setServerRoot($serverRoot); if ($specManager->getWebPixWidth()) { $imageProcessor->setWebPixWidth($specManager->getWebPixWidth()); }
<input name="title" type="text" style="width:300px;" value="<?php echo $specManager->getTitle(); ?> " /> </div> </div> <?php } ?> <div style="clear:both;"> <div style="width:180px;float:left;"> <b>Pattern match term:</b> </div> <div style="float:left;"> <input name="speckeypattern" type="text" style="width:300px;" value="<?php echo $specManager->getSpecKeyPattern(); ?> " /> <a id="speckeypatterninfo" href="#" onclick="return false" title="More Information"> <img src="../../images/info.png" style="width:15px;" /> </a> <div id="speckeypatterninfodialog"> Regular expression needed to extract the unique identifier from source text. For example, regular expression /^(WIS-L-\d{7})\D*/ will extract catalog number WIS-L-0001234 from image file named WIS-L-0001234_a.jpg. For more information on creating regular expressions, Google "Regular Expression PHP Tutorial" </div> </div> </div> <?php if ($projectType != 'idigbio' && $projectType != 'iplant') {