mirror of
https://github.com/KevinMidboe/linguist.git
synced 2025-12-29 13:21:01 +00:00
Minor README
This commit is contained in:
@@ -6,12 +6,9 @@ We use this library at GitHub to detect blob languages, highlight code, ignore b
|
|||||||
|
|
||||||
### Language detection
|
### Language detection
|
||||||
|
|
||||||
Linguist defines a list of all languages known to GitHub in a [yaml file](https://github.com/github/linguist/blob/master/lib/linguist/languages.yml). In order for a file to be highlighted, a language and lexer must be defined there.
|
Linguist defines a list of all languages known to GitHub in a [yaml file](https://github.com/github/linguist/blob/master/lib/linguist/languages.yml). In order for a file to be highlighted, a language and a lexer must be defined there.
|
||||||
|
|
||||||
Most languages are detected by their file extension. This is the fastest and most common situation.
|
Most languages are detected by their file extension. For disambiguating between files with common extensions, we first apply some common-sense heuristics to pick out obvious languages. After that, we use a
|
||||||
|
|
||||||
For disambiguating between files with common extensions, we first apply
|
|
||||||
some common-sense heuristics to pick out obvious languages. After that, we use a
|
|
||||||
[statistical
|
[statistical
|
||||||
classifier](https://github.com/github/linguist/blob/master/lib/linguist/classifier.rb).
|
classifier](https://github.com/github/linguist/blob/master/lib/linguist/classifier.rb).
|
||||||
This process can help us tell the difference between, for example, `.h` files which could be either C, C++, or Obj-C.
|
This process can help us tell the difference between, for example, `.h` files which could be either C, C++, or Obj-C.
|
||||||
|
|||||||
Reference in New Issue
Block a user