Monarch geneset OGS2.0

DPOGS200883
TranscriptDPOGS200883-TA1959 bp
ProteinDPOGS200883-PA652 aa
Genomic positionDPSCF300448 + 12246-26302
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0055630.076.45% 
BombyxBGIBMGA011192-TA2e-12655.48% 
DrosophilaCG16791-PA2e-6132.48% 
EBI UniRef50UniRef50_D2A5P62e-14850.27%Putative uncharacterized protein GLEAN_15574 n=1 Tax=Tribolium castaneum RepID=D2A5P6_TRICA
NCBI RefSeqXP_002426871.12e-12239.05%hypothetical protein Phum_PHUM283590 [Pediculus humanus corporis]
NCBI nr blastpgi|2700089549e-14850.27%hypothetical protein TcasGA2_TC015574 [Tribolium castaneum]
NCBI nr blastxgi|2700089541e-15649.22%hypothetical protein TcasGA2_TC015574 [Tribolium castaneum]
Group
KEGG pathwaynvi:1001143267e-85 
 K12386 (CTNS)maps-> Lysosome
Orthology groupMCL17850 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200883-TA
ATGCACCTCGTAACCCCATTATATATTAAACTGACCTGCGGCGGTTGCACATTCCCCCCCGAGTGGACTGGCAGCTGGTTCCAGTCCGGCGTCCCTGGTCTGATTTCTATCAACTCCACTCACATCCAGTCTAGAGGAGAGTGCTCGGAGACCGAGTCTTACGACAAGTTCCTTCTATACGACAGGGCCCTAGGCAACCTAAGTATATACGGAACACCTGGCCCTAAAACAATACAGCGAGTTTACGATCCATTGAAGACAAACGCGCACACGACAGCATTCATATACTGTACTAGCTCAGCACGCGGCGGTTGCACATTCCCCCCCGAGTGGACTGGCAGCTGGTTCCAGTCCGGCGTGCCTGGTCTGATTTCTATCAACTCCACTCACATCCAGTCTAGAGGAGAGTGCTCGGAGACCGAGTCTTACGACAAGTTCCTTCTATACGACAGGAGCTTCAATTGCTACAAATGCATGGTGATCCACGAGAAACACAAATATGTCCTGCAATATAAAGAGACATTCTGCAGTCCAAAGAACACCCTGTCGACCATATGCGAGGAGATAAGTGGTGATGCTCCACTGTACTCAATGTTCAGGAAGGAGCCGAGGCCGGAGCCGCAGCCCTGCCCCTTCCACCCCGCGCCCTTCACTTTCACGTACAGCAGGGGTTTAGGCGACTGTGTCTACCCTCCGTCGAGGGCTGAGTCCTGTACGGACGATTCGAGGCTTCTGTTGCGATATATGGCCTGCCCGGATGTACCCGGGACCGAGAGCAATGTCGAGGAGCTGGTGTGCCTGGCGACTTGGAAGGAAGGTTCGACCAGGTACCTCGTGGGTCAGATATCGCAGGTCCAGAGAAGAAACTCGATAGCATCAGACGAGGACACGTACAGATGCTTCATCTACAAGGGCCAACACGGTGAAAAAAGCACGTACATTATAGCGCAGTCGGGCGACGCGACTTGCAACGGACTGTCCTCGCCGACTGACGGGAGTCGGACCATGAAACTCACGACCAGCGACGACGAACACACACGCTGTCATTTCCCAAGTTGGATTGTGGAACATCACAAATGGTTCAGCCTTGATCACACTCATCAGTACCATTTCACTACAAAGAACGCGACGTTAAAGGCGAGTACACACACGAGAACTGCTACATTCCAAATATATGCCCATGGCCAAGTGATGAACGCCGATGGGTCGTTTGAGGAGAAGCGCCTCGTCTGCCACTCCATACTGGAACAGAAGGACAAGAAGCACATCAAACTGGTTGCTCACGTCACGAGAGGCTGCGAGTCGGGTCACGTGTGTCTGTCTTTCCACTACCGCGTCCCGGGCGCTGTTCTGGAGCTCCGCGGCGGTTCGTTGTGGGAAGCGCCCCAGGACGCCTGCAGCCAAAACGTGGACAGCCCATACGTCACACTTATCACGACATCTCTGACTCCGATGCAGTGTCCTATACAGGGGAGGTACAGCATAGTTGGCTCCCTGTCGGACAGACGACGCCGCAGACAGCAGTTACCCCAGGGTGACGAGGGCCCCCAAGAAGAGTGTGTGGCTGGTGACTACGACAGCATCAGCATCGGCTGCGGACATTCCCAGGACACGGTGGAGCTGAAATCCAGTTGCAGCAGCACCATGTACACATGTTACGGTGGCTGGCAGGAGGGTAGTCGAGGTTTCCTGGTAGCGGCGCCCGTGGCTAGGGGCTCTTCACATCCGCGGACCTTCTGCTTCATGTACACGCACCATAACGAATCTTCATCAGTGACCACATCCCTCAGTGCCGTCTCCAGATCCTGTGACAGATCAGCTATTCCGGGGAGACACGGAGATGCTGCCTACAATCTGACCAGCAACGGTACATGCCAACAAAGCAGTCAGTACAATAGCTCAGCGAGCCGAGCCACTCCGCTGGTGGCGTTGTTGGCTGCGATACTCACCGTCCGGTGA

Protein sequence:

>DPOGS200883-PA
MHLVTPLYIKLTCGGCTFPPEWTGSWFQSGVPGLISINSTHIQSRGECSETESYDKFLLYDRALGNLSIYGTPGPKTIQRVYDPLKTNAHTTAFIYCTSSARGGCTFPPEWTGSWFQSGVPGLISINSTHIQSRGECSETESYDKFLLYDRSFNCYKCMVIHEKHKYVLQYKETFCSPKNTLSTICEEISGDAPLYSMFRKEPRPEPQPCPFHPAPFTFTYSRGLGDCVYPPSRAESCTDDSRLLLRYMACPDVPGTESNVEELVCLATWKEGSTRYLVGQISQVQRRNSIASDEDTYRCFIYKGQHGEKSTYIIAQSGDATCNGLSSPTDGSRTMKLTTSDDEHTRCHFPSWIVEHHKWFSLDHTHQYHFTTKNATLKASTHTRTATFQIYAHGQVMNADGSFEEKRLVCHSILEQKDKKHIKLVAHVTRGCESGHVCLSFHYRVPGAVLELRGGSLWEAPQDACSQNVDSPYVTLITTSLTPMQCPIQGRYSIVGSLSDRRRRRQQLPQGDEGPQEECVAGDYDSISIGCGHSQDTVELKSSCSSTMYTCYGGWQEGSRGFLVAAPVARGSSHPRTFCFMYTHHNESSSVTTSLSAVSRSCDRSAIPGRHGDAAYNLTSNGTCQQSSQYNSSASRATPLVALLAAILTVR-