mirror of
https://github.com/KevinMidboe/linguist.git
synced 2025-10-29 09:40:21 +00:00
* * add CoNLL-U format - add to languages.yml - add textmate grammar - add to vendor/README - add to grammars.yml - add samples * rm other extensions as I couldn't find properly licensed examples of them in the wild * substitutesamples for something with appropriate license * update grammar submodule so it finds the LICENSE * add license to grammar * * conllu - readd other extensions - abridge samples and a new one - update grammar submodule: correct extension of grammar file * rm .conllx extension
123 lines
6.7 KiB
Plaintext
123 lines
6.7 KiB
Plaintext
# newdoc id = weblog-blogspot.com_zentelligence_20040423000200_ENG_20040423_000200
|
|
# sent_id = weblog-blogspot.com_zentelligence_20040423000200_ENG_20040423_000200-0001
|
|
# text = What if Google Morphed Into GoogleOS?
|
|
1 What what PRON WP PronType=Int 0 root 0:root _
|
|
2 if if SCONJ IN _ 4 mark 4:mark _
|
|
3 Google Google PROPN NNP Number=Sing 4 nsubj 4:nsubj _
|
|
4 Morphed morph VERB VBD Mood=Ind|Tense=Past|VerbForm=Fin 1 advcl 1:advcl _
|
|
5 Into into ADP IN _ 6 case 6:case _
|
|
6 GoogleOS GoogleOS PROPN NNP Number=Sing 4 obl 4:obl SpaceAfter=No
|
|
7 ? ? PUNCT . _ 4 punct 4:punct _
|
|
|
|
# sent_id = weblog-blogspot.com_zentelligence_20040423000200_ENG_20040423_000200-0002
|
|
# text = What if Google expanded on its search-engine (and now e-mail) wares into a full-fledged operating system?
|
|
1 What what PRON WP PronType=Int 0 root 0:root _
|
|
2 if if SCONJ IN _ 4 mark 4:mark _
|
|
3 Google Google PROPN NNP Number=Sing 4 nsubj 4:nsubj _
|
|
4 expanded expand VERB VBD Mood=Ind|Tense=Past|VerbForm=Fin 1 advcl 1:advcl _
|
|
5 on on ADP IN _ 15 case 15:case _
|
|
6 its its PRON PRP$ Gender=Neut|Number=Sing|Person=3|Poss=Yes|PronType=Prs 15 nmod:poss 15:nmod:poss _
|
|
7 search search NOUN NN Number=Sing 9 compound 9:compound SpaceAfter=No
|
|
8 - - PUNCT HYPH _ 9 punct 9:punct SpaceAfter=No
|
|
9 engine engine NOUN NN Number=Sing 15 compound 15:compound _
|
|
10 ( ( PUNCT -LRB- _ 9 punct 9:punct SpaceAfter=No
|
|
11 and and CCONJ CC _ 13 cc 13:cc _
|
|
12 now now ADV RB _ 13 advmod 13:advmod _
|
|
13 e-mail e-mail NOUN NN Number=Sing 9 conj 9:conj SpaceAfter=No
|
|
14 ) ) PUNCT -RRB- _ 15 punct 15:punct _
|
|
15 wares wares NOUN NNS Number=Plur 4 obl 4:obl _
|
|
16 into into ADP IN _ 22 case 22:case _
|
|
17 a a DET DT Definite=Ind|PronType=Art 22 det 22:det _
|
|
18 full full ADV RB _ 20 advmod 20:advmod SpaceAfter=No
|
|
19 - - PUNCT HYPH _ 20 punct 20:punct SpaceAfter=No
|
|
20 fledged fledged ADJ JJ Degree=Pos 22 amod 22:amod _
|
|
21 operating operating NOUN NN Number=Sing 22 compound 22:compound _
|
|
22 system system NOUN NN Number=Sing 4 obl 4:obl SpaceAfter=No
|
|
23 ? ? PUNCT . _ 4 punct 4:punct _
|
|
|
|
# sent_id = weblog-blogspot.com_zentelligence_20040423000200_ENG_20040423_000200-0003
|
|
# text = [via Microsoft Watch from Mary Jo Foley ]
|
|
1 [ [ PUNCT -LRB- _ 4 punct 4:punct SpaceAfter=No
|
|
2 via via ADP IN _ 4 case 4:case _
|
|
3 Microsoft Microsoft PROPN NNP Number=Sing 4 compound 4:compound _
|
|
4 Watch Watch PROPN NNP Number=Sing 0 root 0:root _
|
|
5 from from ADP IN _ 6 case 6:case _
|
|
6 Mary Mary PROPN NNP Number=Sing 4 nmod 4:nmod _
|
|
7 Jo Jo PROPN NNP Number=Sing 6 flat 6:flat _
|
|
8 Foley Foley PROPN NNP Number=Sing 6 flat 6:flat _
|
|
9 ] ] PUNCT -RRB- _ 4 punct 4:punct _
|
|
|
|
# newdoc id = weblog-blogspot.com_marketview_20050511222700_ENG_20050511_222700
|
|
# sent_id = weblog-blogspot.com_marketview_20050511222700_ENG_20050511_222700-0001
|
|
# text = (And, by the way, is anybody else just a little nostalgic for the days when that was a good thing?)
|
|
1 ( ( PUNCT -LRB- _ 14 punct 14:punct SpaceAfter=No
|
|
2 And and CCONJ CC _ 14 cc 14:cc SpaceAfter=No
|
|
3 , , PUNCT , _ 14 punct 14:punct _
|
|
4 by by ADP IN _ 6 case 6:case _
|
|
5 the the DET DT Definite=Def|PronType=Art 6 det 6:det _
|
|
6 way way NOUN NN Number=Sing 14 obl 14:obl SpaceAfter=No
|
|
7 , , PUNCT , _ 14 punct 14:punct _
|
|
8 is be AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin 14 cop 14:cop _
|
|
9 anybody anybody PRON NN Number=Sing 14 nsubj 14:nsubj _
|
|
10 else else ADJ JJ Degree=Pos 9 amod 9:amod _
|
|
11 just just ADV RB _ 13 advmod 13:advmod _
|
|
12 a a DET DT Definite=Ind|PronType=Art 13 det 13:det _
|
|
13 little little ADJ JJ Degree=Pos 14 obl:npmod 14:obl:npmod _
|
|
14 nostalgic nostalgic NOUN NN Number=Sing 0 root 0:root _
|
|
15 for for ADP IN _ 17 case 17:case _
|
|
16 the the DET DT Definite=Def|PronType=Art 17 det 17:det _
|
|
17 days day NOUN NNS Number=Plur 14 nmod 14:nmod _
|
|
18 when when ADV WRB PronType=Rel 23 advmod 23:advmod _
|
|
19 that that PRON DT Number=Sing|PronType=Dem 23 nsubj 23:nsubj _
|
|
20 was be AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 23 cop 23:cop _
|
|
21 a a DET DT Definite=Ind|PronType=Art 23 det 23:det _
|
|
22 good good ADJ JJ Degree=Pos 23 amod 23:amod _
|
|
23 thing thing NOUN NN Number=Sing 17 acl:relcl 17:acl:relcl SpaceAfter=No
|
|
24 ? ? PUNCT . _ 14 punct 14:punct SpaceAfter=No
|
|
25 ) ) PUNCT -RRB- _ 14 punct 14:punct _
|
|
|
|
# sent_id = weblog-blogspot.com_marketview_20050511222700_ENG_20050511_222700-0002
|
|
# text = This BuzzMachine post argues that Google's rush toward ubiquity might backfire -- which we've all heard before, but it's particularly well-put in this post.
|
|
1 This this DET DT Number=Sing|PronType=Dem 3 det 3:det _
|
|
2 BuzzMachine BuzzMachine PROPN NNP Number=Sing 3 compound 3:compound _
|
|
3 post post NOUN NN Number=Sing 4 nsubj 4:nsubj _
|
|
4 argues argue VERB VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin 0 root 0:root _
|
|
5 that that SCONJ IN _ 12 mark 12:mark _
|
|
6 Google Google PROPN NNP Number=Sing 8 nmod:poss 8:nmod:poss SpaceAfter=No
|
|
7 's 's PART POS _ 6 case 6:case _
|
|
8 rush rush NOUN NN Number=Sing 12 nsubj 12:nsubj _
|
|
9 toward toward ADP IN _ 10 case 10:case _
|
|
10 ubiquity ubiquity NOUN NN Number=Sing 8 nmod 8:nmod _
|
|
11 might might AUX MD VerbForm=Fin 12 aux 12:aux _
|
|
12 backfire backfire VERB VB VerbForm=Inf 4 ccomp 4:ccomp _
|
|
13 -- -- PUNCT , _ 12 punct 12:punct _
|
|
14 which which PRON WDT PronType=Rel 18 obj 18:obj _
|
|
15 we we PRON PRP Case=Nom|Number=Plur|Person=1|PronType=Prs 18 nsubj 18:nsubj SpaceAfter=No
|
|
16 've have AUX VBP Mood=Ind|Tense=Pres|VerbForm=Fin 18 aux 18:aux _
|
|
17 all all ADV RB _ 18 advmod 18:advmod _
|
|
18 heard hear VERB VBN Tense=Past|VerbForm=Part 12 acl:relcl 12:acl:relcl _
|
|
19 before before ADV RB _ 18 advmod 18:advmod SpaceAfter=No
|
|
20 , , PUNCT , _ 27 punct 27:punct _
|
|
21 but but CCONJ CC _ 27 cc 27:cc _
|
|
22 it it PRON PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs 27 nsubj:pass 27:nsubj:pass SpaceAfter=No
|
|
23 's be VERB VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin 27 aux:pass 27:aux:pass _
|
|
24 particularly particularly ADV RB _ 27 advmod 27:advmod _
|
|
25 well well ADV RB Degree=Pos 27 advmod 27:advmod SpaceAfter=No
|
|
26 - - PUNCT HYPH _ 27 punct 27:punct SpaceAfter=No
|
|
27 put put VERB VBN Tense=Past|VerbForm=Part 4 conj 4:conj _
|
|
28 in in ADP IN _ 30 case 30:case _
|
|
29 this this DET DT Number=Sing|PronType=Dem 30 det 30:det _
|
|
30 post post NOUN NN Number=Sing 27 obl 27:obl SpaceAfter=No
|
|
31 . . PUNCT . _ 4 punct 4:punct _
|
|
|
|
# sent_id = weblog-blogspot.com_marketview_20050511222700_ENG_20050511_222700-0003
|
|
# text = Google is a nice search engine.
|
|
1 Google Google PROPN NNP Number=Sing 6 nsubj 6:nsubj _
|
|
2 is be AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin 6 cop 6:cop _
|
|
3 a a DET DT Definite=Ind|PronType=Art 6 det 6:det _
|
|
4 nice nice ADJ JJ Degree=Pos 6 amod 6:amod _
|
|
5 search search NOUN NN Number=Sing 6 compound 6:compound _
|
|
6 engine engine NOUN NN Number=Sing 0 root 0:root SpaceAfter=No
|
|
7 . . PUNCT . _ 6 punct 6:punct _
|
|
|