ValidateContentFromTextFile
- Descriptions:
Validate the content of lexical records from a text file. It validates:
- Check Reg (glreg & regd):
- Check inflection rules on glreg for nouns
- Check inflection rules on regd for verbs and adjs
- Check/fix duplicated terms:
- Check duplicated base & spelling variants
- Check duplicated acronyms
- Check duplicated abbreviations
- Check duplicated nominalizations in noun, verb, adj
- Check duplicated variant (irreg) in noun, verb, adj, adv, etc.
- Requirements:
- lexCheck${YEAR}dist.jar
or
- lexCheck${YEAR}api.jar and lvg${YEAR}api.jar
- Usage:
shell>ValidateContentFromTextFile <inFile> <outFile> <-v> <preposition.data> <irregExpFile>
- inFile: lexical record in text format
- outFile: auto-fixed lexical record in text format
- -v: set verbose to true, default: fault
- prepositionFile: the preposition file
default: use the prepositions.data included in the lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
=> prepositions are used in the class of Compl.CheckPreposition.java
- irregExpFile: the irregExceptions.data file, if the specified file does not exist, use the default irregExceptions.data in lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
- Outputs:
- Confirmed message if records are valid. Otherwise, error message
- Auto-fixed Lexicon (remove duplicates)
- Examples:
shell> ValidateContentFromTextFile ./in.txt
shell> ValidateContentFromTextFile ./in.txt ./out.txt
shell> ValidateContentFromTextFile ./in.txt ./out.txt -v
shell> ValidateContentFromTextFile ./in.txt ./out.txt -v ./data/Files/prepositions.data ./data/Files/irregExceptions.data