README UniGene Tabulator January 3, 2013 Quality Control In the UniGene Tabulator table 'SEQUENCE', and the end of UniGene data processing, all records should have a value in the fields NACC (GenBank accession number) and NUID (GenBank Identifier). At the end of the data processing, you may search for records with empty 'NACC' or 'NUID' field [to do this, go to the 'SEQUENCE' table of UniGene Tabulator, choose "Find" on the window bottom bar and then type "=" (without quotes) in the 'NACC' or 'NUID' field]. If you find one or some records without a NACC or a NUID, you may manually fill the missing values in, after obtaining them by searching for the NACC with an empty NUID (or for the NUID with an empty NACC, respectively) at the address: http://www.ncbi.nlm.nih.gov/nuccore This problem may occurr with very large number of sequence records, and it sometimes a FileMaker Pro artifact related to data display rather then to the actual absence of data. The exported file "UniGene Tab" could need manual fixing of these data accordingly. Please also compare the final numebr of records in the tables 'UniGene' and 'SEQUENCE' with the corresponding values available at: http://www.ncbi.nlm.nih.gov/unigene/statistics/