Monarch geneset OGS2.0

DPOGS207731
TranscriptDPOGS207731-TA2061 bp
ProteinDPOGS207731-PA686 aa
Genomic positionDPSCF300042 - 1013473-1016394
RNAseq coverage22x (Rank: top 79%)
Annotation
Heliconius% 
BombyxBGIBMGA005293-TA4e-7535.92% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1234568292e-1219.83%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology groupMCL30326 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207731-TA
ATGAATGAATCAAACGATGTCCAGGCTTTAGAGAAAATATCTTCCGAAGTAGATGAATCTCAATTCAAAATAATTCTAACTAATCAGTGTATCAAGGCTCAAGAGTTTTTAGTTCATAAACTACGTAGGCAATCTGAACAAACTAGACAAATATGTAATTTTATAAGTAGCGGCAATTTGAAATTGGAAAGTGAAATAAGACTTGGGGAGCAAGAAATAAAAAACTTACGCAGTGCTCTTGAAACCATAAAATCCTCAAATGTTGAGTTAGAAAAAGAGCTACTGCTCATTAAGGAAGATAAAATTGATTTGATAGAAAGAGTCAATAACGGAATTAGAAAATATGAGGAACTTTGGCTCACGTCCAAACAAAAATATGACAATATACCATTTATTAAGAAGCAACAGCTCACTATTAATAGAAGGAAGAGCCTTGAGGATACTGTTGTTTGCCTTGATAATGAAATACAACATTTGAGGAGAGTATTTGAGGAAAGAAAGAAGGAATTGTCAAATATCGACAGACAACACGCCATATATATTGCTAGATACATGGTGGATGAACGACCAGATGTCTTACGCTGTCTCACACAGAAGGCAGAAGAAGTAAATAATTTAATGAATGAAATAAAAGACCTTCAGAATGAACAAAACTTCAATTTTCCAAGAACAAATTATTCTAAGGTCGCAGCAGCAGTTGATGCAAAAACTAATGATGAAACTGTCTCAGTTGAAAAGAAGCATTCAGATGATGATAATGAAGATGCAATGCTGTTACCAAAGCTGCAGCTTCAAAATATTGATTTAGACATGCTTATGGATAAATTGGAACAGGTCAAGAAAGCTGATAGTATGTTAATTAAAAGAGCTGATTCTATTACAATACAAATTCAAGAACAACCACATAAATATGGACAAGAAAATATGAATAATATATATGTGTCTTCATATTTTAAAAATATCAATGATGTTAAGGAGGTAAAGGAAAAAGTTCAATCTCAGACATTCAATTATACAGACAAAAAGTTGATTCACATTCTAGAAGATATACAATTGCATAAGAAAGATTCTTATAACATATTTTCAAAGTTAAATCCAAATGAATTACGCATTGTTGATGTTATACCAGCAAGCATTGACATAAATAATAGTAATCAATCACCAGAAAAGCTTGATAAAGGGGCTGGAACAGAGGTTACTAAAGAAATTGACTTAACAAATAAGGATTCATCTAACATCGTAATACCTCCTTCACAATTTCTAGATTTGTCTCAGAACTCTCAAGAGAAAAAAGTAAAATTCAGTACAGTTGTGTCCGTTCAAGAGGTTGAGAATATGAATAAGTCGCAGGAATATGCTGTTGAAGAAAATATTAAAACTGAACTAAGCACAAGTGTTGCCAGTGAAAATAGTTACCAGATAATTAAAGAAAATATCTTCAAGAAACACAACATAGCTTTGTCACCTGAATTTATATATTCAAAAAATCCTAAAATATCTGATAAAAAAACAAAGTTTTGTGATGGAGACGACATGTCCGATAAGCCAACTCAAATTAAGGATGTGTGTGGTCAGAATATGAATACTGAAGTGGAAGTTGGAGCTAGTATTGATGTAGTGAGTAAACAGGAACCAACTGTCGCCGGTTTTCTGTTCACACATGGTCCGAAAGGTATACCTGACTCACTGGATGTTTCAATGGCTTCCACTTGCATTGAAGATGTAGACAATGAATTTCCTCATTGTTTTGATTCCAGCTTACTTTTGTCACCAAAGGCCGATTTAAAAGTGCCAGAGAACATTGTTAATAATACTGGTACTCTATCACAGGAAGTTCCAAACTTCTTATCAGGATTTAAAAAAGTGGGGTTGTCTTTGTTTGGACATTCTTCTGAGAATAATTCAGACACAAATAATAAGTTACCTAATGCTAATAATTTTAATTTTTCTTTTGTTAACACTGAAAAGAGGAATAGGGGCGGATTTTTCAACATTTCCATTAAAAAAATAGTTATTAATTATTGTTTTATTGATGTAACTGACATGCTCTTACAGCTATAA

Protein sequence:

>DPOGS207731-PA
MNESNDVQALEKISSEVDESQFKIILTNQCIKAQEFLVHKLRRQSEQTRQICNFISSGNLKLESEIRLGEQEIKNLRSALETIKSSNVELEKELLLIKEDKIDLIERVNNGIRKYEELWLTSKQKYDNIPFIKKQQLTINRRKSLEDTVVCLDNEIQHLRRVFEERKKELSNIDRQHAIYIARYMVDERPDVLRCLTQKAEEVNNLMNEIKDLQNEQNFNFPRTNYSKVAAAVDAKTNDETVSVEKKHSDDDNEDAMLLPKLQLQNIDLDMLMDKLEQVKKADSMLIKRADSITIQIQEQPHKYGQENMNNIYVSSYFKNINDVKEVKEKVQSQTFNYTDKKLIHILEDIQLHKKDSYNIFSKLNPNELRIVDVIPASIDINNSNQSPEKLDKGAGTEVTKEIDLTNKDSSNIVIPPSQFLDLSQNSQEKKVKFSTVVSVQEVENMNKSQEYAVEENIKTELSTSVASENSYQIIKENIFKKHNIALSPEFIYSKNPKISDKKTKFCDGDDMSDKPTQIKDVCGQNMNTEVEVGASIDVVSKQEPTVAGFLFTHGPKGIPDSLDVSMASTCIEDVDNEFPHCFDSSLLLSPKADLKVPENIVNNTGTLSQEVPNFLSGFKKVGLSLFGHSSENNSDTNNKLPNANNFNFSFVNTEKRNRGGFFNISIKKIVINYCFIDVTDMLLQL-