1 | | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt |
---|
2 | | This file is distributed under the BSD License. |
---|
3 | | See http://snowball.tartarus.org/license.php |
---|
4 | | Also see http://www.opensource.org/licenses/bsd-license.html |
---|
5 | | - Encoding was converted to UTF-8. |
---|
6 | | - This notice was added. |
---|
7 | |
---|
8 | | A Dutch stop word list. Comments begin with vertical bar. Each stop |
---|
9 | | word is at the start of a line. |
---|
10 | |
---|
11 | | This is a ranked list (commonest to rarest) of stopwords derived from |
---|
12 | | a large sample of Dutch text. |
---|
13 | |
---|
14 | | Dutch stop words frequently exhibit homonym clashes. These are indicated |
---|
15 | | clearly below. |
---|
16 | |
---|
17 | de | the |
---|
18 | en | and |
---|
19 | van | of, from |
---|
20 | ik | I, the ego |
---|
21 | te | (1) chez, at etc, (2) to, (3) too |
---|
22 | dat | that, which |
---|
23 | die | that, those, who, which |
---|
24 | in | in, inside |
---|
25 | een | a, an, one |
---|
26 | hij | he |
---|
27 | het | the, it |
---|
28 | niet | not, nothing, naught |
---|
29 | zijn | (1) to be, being, (2) his, one's, its |
---|
30 | is | is |
---|
31 | was | (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river |
---|
32 | op | on, upon, at, in, up, used up |
---|
33 | aan | on, upon, to (as dative) |
---|
34 | met | with, by |
---|
35 | als | like, such as, when |
---|
36 | voor | (1) before, in front of, (2) furrow |
---|
37 | had | had, past tense all persons sing. of 'hebben' (have) |
---|
38 | er | there |
---|
39 | maar | but, only |
---|
40 | om | round, about, for etc |
---|
41 | hem | him |
---|
42 | dan | then |
---|
43 | zou | should/would, past tense all persons sing. of 'zullen' |
---|
44 | of | or, whether, if |
---|
45 | wat | what, something, anything |
---|
46 | mijn | possessive and noun 'mine' |
---|
47 | men | people, 'one' |
---|
48 | dit | this |
---|
49 | zo | so, thus, in this way |
---|
50 | door | through by |
---|
51 | over | over, across |
---|
52 | ze | she, her, they, them |
---|
53 | zich | oneself |
---|
54 | bij | (1) a bee, (2) by, near, at |
---|
55 | ook | also, too |
---|
56 | tot | till, until |
---|
57 | je | you |
---|
58 | mij | me |
---|
59 | uit | out of, from |
---|
60 | der | Old Dutch form of 'van der' still found in surnames |
---|
61 | daar | (1) there, (2) because |
---|
62 | haar | (1) her, their, them, (2) hair |
---|
63 | naar | (1) unpleasant, unwell etc, (2) towards, (3) as |
---|
64 | heb | present first person sing. of 'to have' |
---|
65 | hoe | how, why |
---|
66 | heeft | present third person sing. of 'to have' |
---|
67 | hebben | 'to have' and various parts thereof |
---|
68 | deze | this |
---|
69 | u | you |
---|
70 | want | (1) for, (2) mitten, (3) rigging |
---|
71 | nog | yet, still |
---|
72 | zal | 'shall', first and third person sing. of verb 'zullen' (will) |
---|
73 | me | me |
---|
74 | zij | she, they |
---|
75 | nu | now |
---|
76 | ge | 'thou', still used in Belgium and south Netherlands |
---|
77 | geen | none |
---|
78 | omdat | because |
---|
79 | iets | something, somewhat |
---|
80 | worden | to become, grow, get |
---|
81 | toch | yet, still |
---|
82 | al | all, every, each |
---|
83 | waren | (1) 'were' (2) to wander, (3) wares, (3) |
---|
84 | veel | much, many |
---|
85 | meer | (1) more, (2) lake |
---|
86 | doen | to do, to make |
---|
87 | toen | then, when |
---|
88 | moet | noun 'spot/mote' and present form of 'to must' |
---|
89 | ben | (1) am, (2) 'are' in interrogative second person singular of 'to be' |
---|
90 | zonder | without |
---|
91 | kan | noun 'can' and present form of 'to be able' |
---|
92 | hun | their, them |
---|
93 | dus | so, consequently |
---|
94 | alles | all, everything, anything |
---|
95 | onder | under, beneath |
---|
96 | ja | yes, of course |
---|
97 | eens | once, one day |
---|
98 | hier | here |
---|
99 | wie | who |
---|
100 | werd | imperfect third person sing. of 'become' |
---|
101 | altijd | always |
---|
102 | doch | yet, but etc |
---|
103 | wordt | present third person sing. of 'become' |
---|
104 | wezen | (1) to be, (2) 'been' as in 'been fishing', (3) orphans |
---|
105 | kunnen | to be able |
---|
106 | ons | us/our |
---|
107 | zelf | self |
---|
108 | tegen | against, towards, at |
---|
109 | na | after, near |
---|
110 | reeds | already |
---|
111 | wil | (1) present tense of 'want', (2) 'will', noun, (3) fender |
---|
112 | kon | could; past tense of 'to be able' |
---|
113 | niets | nothing |
---|
114 | uw | your |
---|
115 | iemand | somebody |
---|
116 | geweest | been; past participle of 'be' |
---|
117 | andere | other |
---|