Bibliša: Aligned Collection Search Tool
[
Log In
] [
Register
]
Home
Metadata browse
Metadata search
Mongo search
Manage data
Help
Tutorial
About
Bag of words
ID:
1.2008.1/2.4
Title:
A Suffix Subsumption-based Approach to Building Stemmers and Lemmatizers for Highly Inflectional Languages with Sparse Resources
Authors:
Vlado Kešelj, Danko Šipka
Part of speach:
All
Nouns
Verbs
Adjectives
Adverb
No.
Lemma
Frequency
1
jesam
258
2
u
148
3
dati
91
4
sufiks
83
5
biti
68
6
su
62
7
pravilo
50
8
praviti
46
9
klasa
41
10
a
40
11
resurs
38
12
jezik
33
13
viti
32
14
lem
31
15
oblik
31
16
ala
27
17
po
27
18
sam
25
19
imati
24
20
kao
24
21
pristup
23
22
mocxi
23
23
koje
22
24
s
22
25
ini
22
TAGCLOUD
viti
u
sufiks
su
sam
s
resurs
pristup
praviti
pravilo
po
oblik
mocxi
lem
koje
klasa
kao
jezik
jesam
ini
imati
dati
biti
ala
a