Monarch geneset OGS2.0

DPOGS215408
TranscriptDPOGS215408-TA1380 bp
ProteinDPOGS215408-PA459 aa
Genomic positionDPSCF300088 + 464560-467391
RNAseq coverage166x (Rank: top 51%)
Annotation
HeliconiusHMEL0174209e-9843.79% 
BombyxBGIBMGA012366-TA4e-7437.70% 
DrosophilaCG1105-PA6e-4229.37% 
EBI UniRef50UniRef50_Q9VI539e-4029.37%CG1105 n=12 Tax=Diptera RepID=Q9VI53_DROME
NCBI RefSeqXP_001994718.17e-4130.86%GH17389 [Drosophila grimshawi]
NCBI nr blastpgi|1950556341e-3930.86%GH17389 [Drosophila grimshawi]
NCBI nr blastxgi|1583014922e-4028.07%AGAP001894-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[17-162] IPR0110211.1e-26Arrestin-like, N-terminal
[16-162] IPR0147561.9e-21Immunoglobulin E-set
[190-317] IPR0110226.5e-13Arrestin-like, C-terminal
Orthology groupMCL25955 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215408-TA
ATGACCGCTGTCATGGGAACACAACGCGAAGAACTGCAGAGCGCAATTATAACATTAAACGAACCGAACGCTGTGTATTATTCTGGACAACTTATCAAAGGCAACTTGAATTTTGAACTGAATAAACCTCTTCATTATATTGCTATAAACATACAATATGTTGGGGAGTGTAATGTTTTCTGGATAGAGGAACAGATCGAAGTATATAATGGAGTGAAACAAAAGAAACACATAAAGTATGAGGGACGTGAGGAGTACTTTAACGTCATCCACTGTCTGAGCGGGGGGGATGGTGGTACATGTGTTCTGGCCACGGGACCACATTCGATCCCATTCTCCTATCAGCTACCATCCAACATTCCATCATCTTTTAAGGGTGATAAAGGGACCATCAGCTACAGTATTGTCGTTAGAGTGCTAATGACGGGATTCACTAACCAAGAGACCACCAAGGACTTTGATGTCGTATCACCCGCAGACTTGAATCAGGGCGGTGATAATATTAAGAAGCCAGTCATCCTGAATTTTGAAGAAACATCGAGTTGCAACCTTTTCTGTGTGACTAGGCCCTTGTCCGTGGAAGTGAAGCTGCCAGCATCCGGCTTCTGTCCCGGGCAGACGATACCCATCACAGTAGATATCAAGAATAAAACAAACTTGGAACTTTCTAAGATTGTCTTTGAAATATCTACAAAAGAGCGATATCGCAGCCTCCAACCGGTGTCAGCGTTCATACCTCCTGAGGACGTGTTAGTGTCTATTAAAAAAGGTCCCGTCCTAGCTAAAACCTGCAAAGAATATATGTGGGAGTTGAAAATACCAGAATTCATAGCTCCCAATTTAGAGAATTGCAGTATTATTGATGTGGGCTTCTTCTTCAAGGTAAAAATAAAGATGTCAGGTTGTATGGATGACATGTACGACGAGGCCGAGATCTGGTTGGGTCTGGTACCGTTGGGATCGTCCGGCGTGTCCTCCCACCCCCTGGCCGAGCGGCTGCCCATCGCAGCCATACCCCCCGCCACCCCTCCTCCGCCGTACGAATCACCACAGATGCCGCCACCGTACATACCAAATGTCCCGAACGTCCAGATCTGTCCCCCCGGACCCGTCCTCTTCCCTACTGTAGCCAACGTCGTAGATAAAAGCCTCGCTTACGGCTCAAAGAGCAGTCCTTTGGGTGCCTTCGAGATCGGCTTCCGACCCCCGGGAAATTCCTCGATGCCCGTTCCCAATCATCCATATCCGGATTTCGAAGATCAAATACATCAGAGGCCAGACTTACATCCATACCCTGAACCTGCGGCTTCTGAACCCTACTCCGGCCGCCCCTCCGCTCCACCGCCCCCCCTAAAACCTCATAGCCTATTAATATATTAA

Protein sequence:

>DPOGS215408-PA
MTAVMGTQREELQSAIITLNEPNAVYYSGQLIKGNLNFELNKPLHYIAINIQYVGECNVFWIEEQIEVYNGVKQKKHIKYEGREEYFNVIHCLSGGDGGTCVLATGPHSIPFSYQLPSNIPSSFKGDKGTISYSIVVRVLMTGFTNQETTKDFDVVSPADLNQGGDNIKKPVILNFEETSSCNLFCVTRPLSVEVKLPASGFCPGQTIPITVDIKNKTNLELSKIVFEISTKERYRSLQPVSAFIPPEDVLVSIKKGPVLAKTCKEYMWELKIPEFIAPNLENCSIIDVGFFFKVKIKMSGCMDDMYDEAEIWLGLVPLGSSGVSSHPLAERLPIAAIPPATPPPPYESPQMPPPYIPNVPNVQICPPGPVLFPTVANVVDKSLAYGSKSSPLGAFEIGFRPPGNSSMPVPNHPYPDFEDQIHQRPDLHPYPEPAASEPYSGRPSAPPPPLKPHSLLIY-