ValidateContentFromTextFile

  • Descriptions:
    Validate the content of lexical records from a text file. It validates:
    • Check Reg (glreg & regd):
      • Check inflection rules on glreg for nouns
      • Check inflection rules on regd for verbs and adjs
    • Check/fix duplicated terms:
      • Check duplicated base & spelling variants
      • Check duplicated acronyms
      • Check duplicated abbreviations
      • Check duplicated nominalizations in noun, verb, adj
      • Check duplicated variant (irreg) in noun, verb, adj, adv, etc.

  • Requirements:
    • lexCheck${YEAR}dist.jar or
    • lexCheck${YEAR}api.jar and lvg${YEAR}api.jar

  • Usage:
    shell>ValidateContentFromTextFile <inFile> <outFile> <-v> <preposition.data> <irregExpFile>
    • inFile: lexical record in text format
    • outFile: auto-fixed lexical record in text format
    • -v: set verbose to true, default: fault
    • prepositionFile: the preposition file
      default: use the prepositions.data included in the lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
      => prepositions are used in the class of Compl.CheckPreposition.java
    • irregExpFile: the irregExceptions.data file, if the specified file does not exist, use the default irregExceptions.data in lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar

  • Outputs:
    • Confirmed message if records are valid. Otherwise, error message
    • Auto-fixed Lexicon (remove duplicates)

  • Examples:
    • shell> ValidateContentFromTextFile ./in.txt
    • shell> ValidateContentFromTextFile ./in.txt ./out.txt
    • shell> ValidateContentFromTextFile ./in.txt ./out.txt -v
    • shell> ValidateContentFromTextFile ./in.txt ./out.txt -v ./data/Files/prepositions.data ./data/Files/irregExceptions.data