Lexicon
Multilink’s lexicon currently contains 1466 pairs of 3-8 letter Dutch words and their most prominent 3-8 letter English translation equivalents. For both Dutch and English words, frequencies of usage (in occurrences per million) and phonological specifications (SAMPA) are included.
The frequencies in Occurrences Per Million (OPM) are largely derived from SUBTLEX-UK and SUBTLEX-NL; in a few cases a CELEX frequency was chosen when other information was missing. Note that all English frequencies have been divided by 4, in order to simulate the performance of unbalanced Dutch (L1) - English (L2) bilinguals. We are currently evaluating this admittedly arbitrary setting for subjective frequency.
For demonstration purposes, the lexicon contains some interlingual homographs (defined here as two word forms from different languages with the same orthography but a different meaning) and cognates (translation equivalents with form overlap). However, the set of currently included homographs is not exhaustive or representative for the languages as a whole.
Here is an example of several lines from the lexicon:
Dutch:O | Dutch:O:freq | Dutch:P | Dutch:P:freq | English:O | English:O:freq | English:P | English:P:freq |
---|---|---|---|---|---|---|---|
AANBOD | 26.85 | ambOt | 26.85 | OFFER | 18.6775 | Qf@R | 18.6775 |
AANDACHT | 56.69 | andAxt | 56.69 | ATTENTION | 24.6675 | @tEnSH | 24.6675 |
AANDEEL | 9.95 | andel | 9.95 | SHARE | 17.3775 | S8R | 17.3775 |
AANLEG | 2.88 | anlEx | 2.88 | INSTANCE | 4.195 | Inst@ns | 4.195 |
AAP | 28.56 | ap | 28.56 | MONKEY | 8.3775 | mVNkI | 8.3775 |
AARD | 15.3215 | art | 15.3215 | NATURE | 11.29 | n1J@R | 11.29 |
AARDAPPEL | 3.3387 | ardAp@l | 3.3387 | POTATO | 2.8225 | p@t1t5 | 2.8225 |
AARDBEI | 1.56 | ardbK | 1.56 | STRAWBERRY | 1.3825 | str$b@rI | 1.3825 |
AARDE | 100.07 | ard@ | 100.07 | EARTH | 24.8725 | 3T | 24.8725 |
AARDIG | 191.95 | ard@x | 191.95 | FRIENDLY | 6.51 | frEndlI | 6.51 |