Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: Lexicon size


  
  Method of encoding compressed data - United States Patent 5,001,478
A lexicon of this size should be supported by a hash table with at least 2,048 entries to avoid excessive numbers of hash table collisions.
If a match is found in the lexicon, then, as shown in block 19, the match is set to the type lexicon with a corresponding index value and, returning to block 15, another symbol from the input sequence is read and it is appended to the token.
After the lexicon reference, history reference, the literal reference has been emitted, as shown in block 35, the reference string is appended to the history buffer and deleted from the beginning of the token.
xrint.com /patents/us/5001478   (3538 words)

  
 Evaluating Lexicon
A natural initial measure of dictionary quality would be the size of the dictionary; one might hope that bigger would be better, in terms either of the number of headwords or the total number of entries in the lexicon.
Furthermore the exclusion of single characters changes lexicon size far less than the difference between any of the primary dictionaries, a decrease of 5,000 words in contrast to differences of between 30,000 and 100,000, but this change results in highly significant decreases in performance.
In contrast, for lexicon coverage, using either ``by token'' or ``idf-weighted'' metrics, the prediction is that Opti consistently exceeds LDC for all words, stopped or unstopped English and with and without single characters in the Chinese.
www.umiacs.umd.edu /~gina/cv/papers/Beijing_99/evaluating_lexicon.htm   (3072 words)

  
 Old and antique prints and maps:
Size 9.5 x 15.5 cms including title, plus margins.
Size 10.5 x 17 cms including title, plus margins.
Size 14 x 11.5 cms including title, plus margins.
www.antiqueprints.com /products.php?pg=204   (616 words)

  
 Automating the production of bibliographic records for MEDLINE
Candidate lexicons and lookup criteria were evaluated with the goal of removing low confidence values from ground truth words that were correct, while retaining the low confidence values for those words that were not correct.
However, when using the complete lexicon, the rate of false positives was also high because a single OCR error or omission can result in a word that is found in the lexicon.
Combinations of matching techniques and lexicon sizes were tested in an effort to reduce the false positive rate and the processing time while maintaining the high match rates that had been observed.
archive.nlm.nih.gov /pubs/thoma/mars2001_10.php   (2873 words)

  
 Building a Distributed Full-Text Index for the Web
For buffer sizes less than 40, loading proved to be the bottleneck, and both the processing and flushing phases had to wait periodically for the loading phase to complete.
However, as the buffer size increased beyond 40, the processing phase dominated the execution time as larger and larger buffers of postings had to be sorted.
The crucial factors determining index size are the number of internal pages (a function of the height of the B-tree) and the number of overflow pages (which Berkeley DB uses to handle large value fields).
www10.org /cdrom/papers/275   (8107 words)

  
 Lexicon Bookshop specialists in Manx Books - General books
Pages: 64 Size: 15x21 PB This book is a study of Manx milestones and traces back the history of some of them.
Pages: 183 Size: 21x15 PB This murder/mystery novel begins when the Isle of Man is shaken by the discovery of two macabre and apparently senseless murders.
Pages: 107 Size: 30x22 HB A history of Manx lighthouses from 1786 together with details of a keeper's life in the 1850's.
www.lexiconbookshop.co.im /pages/books/general.html   (497 words)

  
 Research Progress   (Site not responding. Last check: 2007-10-30)
Use lexicon based compression to approximate Kolmogorov Complexity and construct a metric formula as proposed by Ming Li.
K(x): for input x, use Viterbi Training method to construct its lexicon and use huffman coding to compress the viterbi path of the input.The size of the compressed input is the approximate Kolmogorove Complexity of input x.
Then use Viterbi Training method to optimize its lexicon and use huffman coding to compress the viterbi path of the input.The size of the compressed input is the approximate Kolmogorove Complexity of input x based on y.
people.cs.uchicago.edu /~jliu/research.html   (710 words)

  
 [No title]
The first introductory action was to perform a feasibility study aiming at the exploration of the basic conditions for the implementation of the initiative and through that to procure the necessary basis for decisions before a start-up of the project.
The size of each corpus is planned to be between half a mill.
The development of a large size computational lexicon is costly and time-consuming work; therefore existing lexical resources will be integrated into the STO database to the greatest possible extent provided that it is technically and legally feasible.
cst.dk /sto/granada/uk   (4383 words)

  
 Sororities
Perhaps most significantly, Shipman and Zue (1982) showed that phonemic patterns are sufficient to isolate a significant number of the words in a 20,000 word lexicon (for example, the pattern: [cons] [cons] [l] [vowel] [nasal] [stop] uniquely identifies “splint” in English).
Shipman and Zue focused only on the relationship between phonemic patterns and lexicon size — no in-depth study has been made of the structural properties of the lexicon independent of word frequency or specifically with reference to sonority.
It uses the Hoosier Mental Lexicon (Nusbaum et al, 1984) as a basis for a statistical analysis of the sonority contours of approximately 20,000 words and their relationship to frequency and familiarity.
www.shaav.com /professional/linguistics/sororities.html   (655 words)

  
 Word recognition with BREF   (Site not responding. Last check: 2007-10-30)
For the 10K lexicon the average number of phone nodes per word is reduced from 6.4 to 2.0 by using such a tree instead of a linear representation of each word, giving a reduction of 69%in the size of the graph.
The use of the word-pair grammar reduces the perplexities to 101 for the 1K lexicon and 160 for the 3K lexicon, and reduces the error rate by almost 60%.
In addition, the drop in performance observed by increasing the lexicon size is smaller than for the no grammar case, as is expected given that the perplexity is not proportional to the size of the lexicon.
www.limsi.fr /Recherche/TLP/reco/rmsep92/section3_4.html   (509 words)

  
 Lexicon MC-12 Downloads
Lexicon is pleased to provide the V1.60 Configuration Utility.
This version replaces all previous versions of the Configuration Utility, and is compatible with all current versions of Lexicon processors.
Lexicon is a division of Harman Specialty Group and is subject
www.lexicon.com /products/download-details.asp?ID=1&FileID=80   (163 words)

  
 [No title]
This lexicon can be automatically generated out of the sentences to be recognised.
Modularity: The program including a small fullform lexicon can be separated from the files containing expressions to be entered in the lexicon of stems.
Size: - lines of source code: 12,000 - kilobytes of executable: 500KBytes - man years of work: 3 years data: 200 word JPSG entry for Japanese.
www.umich.edu /~archive/linguistics/software/nl.software.registry/body.ascii   (9553 words)

  
 Word Identification and Eye Movements in Reading Chinese: Chapter 9
Four smaller lexicons with different sizes were also compiled by using different fragment frequencies as cutoff points to select entries from the 131,616-word master lexicon.
Figure B2 that as the size of lexicon decreases, the distribution of saccade lengths shifts rightward.
Figure B2 is that even if the size of lexicon is reduced to only one quarter of the original size, the saccade length distribution is very similar to that found with the entire lexicon.
research.chtsai.org /dissertation/chapter-9.html   (5542 words)

  
 BEIKS>Palm>Dictionary>Language>Japanese: Japanese dictionaries
Excellent for travelers, but due to the rich content may also be of use to professional translators.
Romanized transcription is used to represent the Japanese terms and due to the use of ancient words this edition may not be as good for travelers as the Gold version.
of the full lexicon size or are limited to only the first few letters of the alphabet.
www.beiks.com /palmzonebg/Lexicons/Japanese.htm   (300 words)

  
 DJGPP Lexicon
To reduce the size and complexity of each language's compiler, the compilation is broken down into different parts, each handled by a different program.
Also, the startup code is a fixed size, meaning that the larger your program is, the less proportional amount of it will be taken up by the startup code.
A program that modifies a number of parameters in the stub, such as the name of the DPMI server, the amount of stack space to allocate, and the size of the transfer buffer.
www.delorie.com /djgpp/doc/lexicon   (5145 words)

  
 Lexicon
Special RAM (sometimes built into the processor) in which frequently accessed pieces of information can be stored to avoid having to search the entire memory bank for them.
Although the cartridges are different sizes, it is "partially" compatible with the Sega Master System.
The disks, which are the same size as CDs, are capable of holding 1 gigabyte of data.
faqs.ign.com /lexicon.html   (8976 words)

  
 Old and antique prints and maps: Antique maps
Size 38.5 x 27 cms plus good margins.
Size 38 x 26 cms plus good margins.
Size 39 x 26 cms plus good margins.
www.antiqueprints.com /products.php?cat=18&pg=42   (778 words)

  
 Old and antique prints and maps: Antique maps
Size 18 x 22 cms plus good margins.
Size 18 x 22.5 cms plus good margins.
Size 25 x 21.5 cms plus good margins.
www.antiqueprints.com /products.php?cat=18&pg=42   (740 words)

  
 Named Entity Extraction from Broadcast News
The reason is that speech lexicons are designed to include the most frequent words, thus ensuring that OOV words will represent only a small fraction of the words in any test set.
To explore this, we measured the percentage of names in the Broadcast News data that contain at least one OOV word as a function of lexicon size.
The percentage of in-vocabulary events of each type as a function of lexicon size is shown in Table 4.
www.nist.gov /speech/publications/darpa99/html/ie20/ie20.htm   (2920 words)

  
 Palm Dictionary. French dictionaries for Palm OS
This is a lexicon translates from English to French.
The Standard lexicon has the basic set of words used in almost most all areas of life.
The most comprehensive lexicon of its size for Palm.
www.beiks.com /palmzonebg/Lexicons/French.htm   (254 words)

  
 *TALŌ's spelling languages
Two lexicons are available, one according to the spelling of the Le Nouveau Petit Robert (2003) and one according to the most recent Rectifications de l’orthographe of the Conseil supérieur de la langue française first published 6 December 1990 (see also
The lexicon agrees with the spelling rules of the Suid-Afrikaanse Taalkommissie, 2002.
The Zulu language is spoken in the Republic of South Africa and is written in the Latin alphabet.
www.talo.nl /talo/spellingcheckers/spellingLanguages.html   (1721 words)

  
 Lexicon Product Listing | Sweetwater.com
Differentiating itself from standard computer I/O boxes which are typically based on a patch-bay paradigm, the Omega 8x4x2 USB I/O mixer is based on a mixer paradigm and includes input, output and mixing functions that support a variety of tracking/...
The remote control is about the size of a laptop and features a joy stick, big buttons, faders and an angled user-interface screen.
The PCM 91 Digital Reverberator offers Lexicon's highest quality reverbs in a compact, affordable package with a powerful interface which allows both easy access and a wealth of programming capabilities for the sound designer...
www.sweetwater.com /store/manufacturer/Lexicon   (828 words)

  
 Speech Synthesis in Festival - 6 Linguistic/Prosodic processing
Multiple lexicons are supported at once as different lexicons may be required even for the same language, e.g.
The phones produced by a lexicon should be suitable for the waveform synthesis method that is to be used (though Festival does supports phoneme mapping if really desired).
BAsically the lexicon optimised models were over trained for that test set, so we relaxed the stop criteria for the CART trees and got a better result on the 1,775 unknown words.
festvox.org /festtut/notes/festtut_6.html   (6254 words)

  
 Card Games: Commercial Games
The core of the game is a 50-card deck corresponding to the states of the USA, each showing a map of the state, its rank (1st to 50th) in size, population and "statehood" (date at which it became a state of the USA), its capital, three largest cities, bordering states, state flower and state bird.
Lexicon was first published in Britain by Waddington in 1933, and an American version was launched by Parker Brothers in 1937.
In the first deal each player is dealt 1 card; the size of hands increases by one in each deal, until the whole pack (or as much of it as possible) is dealt.
www.pagat.com /com   (12437 words)

  
 Solving the Lexicon Acquisition Problem
A first attempt to solve the Lexicon Acquisition Problem might be to examine all interpretation functions across the corpus, then choose the one(s) with minimal lexicon size.
Therefore, finding a lexicon by examining all interpretations across the corpus, then choosing the lexicon(s) of minimum size, is clearly not tractable.
This does not completely rule out fracturing as part of a technique for lexicon learning since trees do not tend to get very large, and indeed Siskind uses it in many of his systems, with other constraints to help control the search.
www.cs.cmu.edu /afs/cs.cmu.edu/project/jair/pub/volume18/thompson03a-html/node9.html   (719 words)

  
 size - OneLook Dictionary Search
SIZE : 1911 edition of the Encyclopedia Britannica [home, info]
Phrases that include size: pint size, full size, size up, pocket size, ask size, more...
Words similar to size: sized, sizing, capacity, dimension, extent, magnitude, mass, size of it, more...
www.onelook.com /?loc=lemma3&w=size   (397 words)

  
 An Gramadóir : History   (Site not responding. Last check: 2007-10-30)
Added a complete morphological analyzer which greatly improves the error messages when words are not found in the lexicon.
Also, since the lowered version of a word like "hAire" is not automatically searched (since the capitalized version is in the lexicon), this gets the correct, unambiguous masculine POS tag.
Or in "bPáirtí Glas", the first word is now recognized unambiguously as a noun which then has the added benefit of allowing "Glas" to be correctly recognized as an adjective.
borel.slu.edu /gramadoir/stair.html   (681 words)

  
 Lexicon Effects on Chinese Information Retrieval - Kwok (ResearchIndex)   (Site not responding. Last check: 2007-10-30)
Abstract: We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our method of short-word segmentation based on simple language usage rules and statistics.
These rules allow us to employ a small lexicon of only 2,175 entries and provide quite admirable retrieval results.
...new words discovered from the collection, the final lexicon size is about 43K.
citeseer.ist.psu.edu /43981.html   (467 words)

  
 Department of Defense Acronyms Dict - Palm Software Application Store   (Site not responding. Last check: 2007-10-30)
This is a FREE Lexicon for BDicty Dictionary Reader.
The lexicon file size is over 88 kB.
BDicty is a general purpose dictionary program that allows hosting of miscellaneous language translation lexicons (distributed separately).
palmsource.palmgear.com /index.cfm?fuseaction=software.showsoftware&prodid=28416   (134 words)

  
 How likely are chance resemblances between languages?
The reason is not hard to find: as the lexicon size increases, the chance of a given match goes down, but we get more chances.
is of course 1 in n (the lexicon size).
Let's assume the lexicon is 2000 words-- hopefully a good estimate of the size of the lexicons available for many of the obscure languages they work with.
zompist.com /chance.htm   (6096 words)

  
 Palm Software:  Travel --> Translation & Language
This is a FREE New Testament Verbs lexicon for BDicty.
TagalogPalm is a revolutionary standard for handheld computing, a new form of computing focused to help people manage and access information at any time, in any location and moreover in their own native language.
This is another free dictionary from the ever growing collection of FREE lexicons for BDicty.
applications.palmsource.com /Software/Solutions.asp?PCID=28&PSCID=126&offset=300   (717 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.