Monarch geneset OGS2.0

DPOGS209922
TranscriptDPOGS209922-TA1719 bp
ProteinDPOGS209922-PA572 aa
Genomic positionDPSCF300180 - 203069-211908
RNAseq coverage405x (Rank: top 30%)
Annotation
HeliconiusHMEL0172984e-17560.71% 
BombyxBGIBMGA010925-TA0.071.07% 
DrosophilaCG16791-PA5e-11236.59% 
EBI UniRef50UniRef50_D6X2V34e-14065.38%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X2V3_TRICA
NCBI RefSeqXP_967292.21e-13864.15%PREDICTED: similar to AGAP005275-PA [Tribolium castaneum]
NCBI nr blastpgi|2700142001e-13965.38%hypothetical protein TcasGA2_TC016285 [Tribolium castaneum]
NCBI nr blastxgi|2700142003e-14065.38%hypothetical protein TcasGA2_TC016285 [Tribolium castaneum]
Group
KEGG pathwaynvi:1001143261e-117 
 K12386 (CTNS)maps-> Lysosome
Orthology groupMCL16551 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209922-TA
ATGATAATTGAGTCGTGTAATAGATATGTTGTTTTCTTAGGTTGTTCGTTCCCGGTTCGCTGGCAGGGTCGCTGGTTCCAGTCCGGGGTGATCCAGCCCATCATGATAGACGGAGCTGTTCTCTCTAACAAGGGGAGGTGCCTCTCATCCGAGGGTGACAAGTTCCTTATTGTCGACGAGAAAGGCTGTTACCGCTGTGTCGTGATGCACGAAAAACATATTAATGTTCTACAATATAAAGAAACTTTCTGTCACCGTCGCGATGCCTTACCCCACCTCTGTTCTTCTATCACCGGCGATGCGTTGCTATACTCTATGTTCCGGGAGAGCGCCGAGCCCGTCGACTGCCCTCTCAAAGGACCCTTCTCATTCACTTATAACAGGGGCCACGGGGATTGCAAGATTCCGGCCTCATCCATCGAGAGCTGCACCGAAGATTCGAGACTGCTGCTCAACTACCAGGCGTGCCCTGACGTTTACGGATCAGAAAGTACAGTGGAGGAATTAGAATGCTTGGCGACGTGGAAGGAAGGTAGTTTGAGGTTCCTGGTGGGCAAGCTGCATCACAACCACGCCACCAGCAACGAGGACAGATACCGCTGCTTCGTGTACGAGAAGACAAATGGTATTGCATCAGGTAGTAATATGAAGGAGCCAGCTCCTGGCGGAGTGGAGTATAGAGTGGCGCAATCCGGGGACGCCACGTGCAACGGACTGTTCAGTGCCACTGAGGGCTCTCGGACCATGGCTTTGAAACGAGTTTCAGTTCGCTTCAACTGTCAGTTCCCTTCGTGGATGACCTTCTCTCACACGTGGCACACGCTGGACTTCAGCAGTAACTACACCTTCTACCAGCGTAACGCGACCCTCCGCATCACCAACCAGACCGGCTCCGAGATCAAGGTGTACTGCGTCAGCATCAAGGCCAGCTCCCCCAGCGGCAACTCGGTCGCCCTGGTCGCGCACTGGCAACACCACTGCGTGTCTCGCTTCGTGTGCGTGGTGCTGTATCGCCGCGACACCTTCATAGCGGAGCTGCAGCGAGGGTCTCCGGCCGCGCGGCCCGACGACGCCTGCTCCACGCATCACTTCAACGCCGTCACAGCGCCATACGTCACGCTCGTTGCTAGCAATCCTGAATCTAAAGAGTGTCCAGACTCAGGGAAATACGTGATATCGAACAGACGTCACAAGAGGAGTGACGGCGCGAGGAGCGCGGCGGTGGAGGGGAGGAGGAGAAATAACACTAGGACGTTCAGCTTTAATATAAGGAACATGTCCGACACGCCCACGCTGAGGAGCCGGAGACACACAGAGGCCGCGAACTGCGCGGGCGGCTACAACAGACTGGAGATCGGCTGCACCTCCACCAACAACATGGAGTTCTACTCCAGCTGCGACAACAGAGACCTCGTCACAGCGTACACGTGCCACGGCGGCTGGTATGAGGGTGGTTCGTCGTTCGTGGTGACGACCCCCGTGACCCGGGACAGCACCGCCGCCCGCCGGTACTGCTTCGTGTCCCGGGACAACCGCGGCAGTCTCTCGCTCACCCGCTCCCAGGATAACTGCGAGCGCGGGGAGAGAACAGCCGTCGTGTTTGACGCTGTGTTCACCGGTAAATGTCAAGACGAGCCCAACCACCAGCCGCCGTCGAGACCTCCGTCACTCTTCGTCGCCCTCCTGATGCTGGCCGCACAGCACGCGGCGCGGAGGTGA

Protein sequence:

>DPOGS209922-PA
MIIESCNRYVVFLGCSFPVRWQGRWFQSGVIQPIMIDGAVLSNKGRCLSSEGDKFLIVDEKGCYRCVVMHEKHINVLQYKETFCHRRDALPHLCSSITGDALLYSMFRESAEPVDCPLKGPFSFTYNRGHGDCKIPASSIESCTEDSRLLLNYQACPDVYGSESTVEELECLATWKEGSLRFLVGKLHHNHATSNEDRYRCFVYEKTNGIASGSNMKEPAPGGVEYRVAQSGDATCNGLFSATEGSRTMALKRVSVRFNCQFPSWMTFSHTWHTLDFSSNYTFYQRNATLRITNQTGSEIKVYCVSIKASSPSGNSVALVAHWQHHCVSRFVCVVLYRRDTFIAELQRGSPAARPDDACSTHHFNAVTAPYVTLVASNPESKECPDSGKYVISNRRHKRSDGARSAAVEGRRRNNTRTFSFNIRNMSDTPTLRSRRHTEAANCAGGYNRLEIGCTSTNNMEFYSSCDNRDLVTAYTCHGGWYEGGSSFVVTTPVTRDSTAARRYCFVSRDNRGSLSLTRSQDNCERGERTAVVFDAVFTGKCQDEPNHQPPSRPPSLFVALLMLAAQHAARR-