Kategori:Polyglotta:Documentation:SearchHelp

Fra hf/dmlf
Hopp til: navigasjon, søk

Search Documentation for Bibliotheca Polyglotta

One can search any word or phrase in the corpus, in a specific library, or in a chosen set of texts, and have the search results written out, and further access any search result in its sentence by sentence multilingual mode by clicking on the "Go to record".

The default search is performed within one's present location in the BP. The four search modes are:

  • Search for exact phrase (searches for every instance of the exact fragment, word or phrase in a record);
  • Search for exact phrase with regular expressions (same as above, with regex; see description below);
  • Search for every word in one record (searches for every instance of whole words, either a single word or two or more different words occuring separately within one multilingual record);
  • Search for every word fragment in one record (searches for every instance of a word fragment, a single fragment, or two or more different fragments occuring separately within one multilingual record).

The advanced search option "Choose specific texts..." may be used to limit one's search within specific libraries or specific texts.

For searching with regular expressions (regex) the BP employs Perl Compatible Reular Expressions (PCRE). The most important meta-characters include (for a full description see http://perldoc.perl.org/perlre.html):

\     Quote the next metacharacter
^     Match the beginning of the line
.     Match any character (except newline)
$    Match the end of the line (or before newline at the end)
|     Alternation
()    Grouping
[]    Bracketed Character class
*     Match 0 or more times
+    Match 1 or more times
 ?     Match 1 or 0 times
\l     lowercase next char (think vi)
\u    uppercase next char (think vi)
\L    lowercase till \E (think vi)
\U   uppercase till \E (think vi)
\w   Match a "word" character (alphanumeric plus "_", plus other connector punctuation chars plus Unicode marks)
\W   Match a non-"word" character
\s    Match a whitespace character
\S    Match a non-whitespace character
\d    Match a decimal digit character
\D    Match a non-digit character
\X    Match Unicode "eXtended grapheme cluster"

Denne kategorien inneholder for tiden ingen artikler eller filer.