Some languages are sensitive to file names in the sense that
different kinds of files contain somewhat different data.
Example: GAP .tst files contain test cases, which add some
extra data compared to regular code, and as a consequence are
not directly interchangeable with regular source code.
Heuristics may need to take this into account, thus may need
to know the name of the file being analyzed.
* origin/master:
Add Gemfile.lock sample
Remove deprecated method
#all_extensions already includes primary extension
typo
remove unused assertion
Symlink ant.xml to build.xml
Avoid shadowing variable name
Update comment
Make missing sample failure message similar
Remove blank extensions property
Fix sample tests
Add Forth extensions .f and .for; add heuristics for Forth and FORTRAN.
Add FORTRAN and Forth samples.
Extensions aren't actually required
Fix errors from pedantic test
Make pedantic test actually pedantic
Removing extensions when they should be filenames
Adding sample pom.xml files
Link to contributing docs
require samples if filename matches multiple languages
Conflicts:
test/test_pedantic.rb
We require samples for explicitly defined filenames that matches multiple languages. This is generally a good thing, but in this case they will be identical.
* origin/master:
Allow mime-types 2.x to be used with Linguist
Upgrade to rugged 0.22.0b1
Mention that languages need to be quite popular
fix vendor/cache
Gemfile.lock is nolonger considered generated
Tests for BlobHelper#empty?
remove reference to empty.js
Remove more empty samples
Bail earlier if the file is empty.
Moving comments
Use heuristics earlier to inform the rest of the classification process
Removing inconsistency of `find_by_heuristics` (was sometimes returning nil and sometimes returning and empty array)
Removing unused array of candidate languages.
Reworking most heuristics to only return one match