Monarch geneset OGS2.0

DPOGS206459
TranscriptDPOGS206459-TA3525 bp
ProteinDPOGS206459-PA1174 aa
Genomic positionDPSCF300070 - 41999-45523
RNAseq coverage526x (Rank: top 24%)
Annotation
HeliconiusHMEL0129340.075.95% 
BombyxBGIBMGA005470-TA0.064.85% 
Drosophilasu(s)-PA2e-4237.45% 
EBI UniRef50UniRef50_UPI0002246FC79e-7140.35%UPI0002246FC7 related cluster n=1 Tax=unknown RepID=UPI0002246FC7
NCBI RefSeqXP_001660611.12e-5843.32%hypothetical protein AaeL_AAEL010074 [Aedes aegypti]
NCBI nr blastpgi|3407263747e-7337.80%PREDICTED: hypothetical protein LOC100649901 [Bombus terrestris]
NCBI nr blastxgi|3227843623e-12731.02%hypothetical protein SINV_01065 [Solenopsis invicta]
Group
KEGG pathway 
Orthology groupMCL18837 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206459-TA
ATGGCAGATCCAGCTACAGTTACAGAAGATTTAGAAGATGGTGAGATTGAAAGCGATGGGGAAGAAACAAATGAAACACAAAATGAAGAAAAGCCGGAGGAAGCAGCTGCAGTTGTTGAAGAAAATAAAGCAAAAGTAGATAGTCAAAGTTTTTATTATTCTAAGTCAGGTAAGAAGAAGAAGTCAAAATCAAATAAGAGCGAGCAGAAGGTTAAAGATAAAAGTAAAAAGAGTCGGAAGTATGCGGAAACGATTGAGGATGACTTTGCCGGTGGGATAGAAAAAGCAATAAGAAAGGCTATGAACAAAAACGATGGCAACAATTTTCAAGGTGAAGCTGAAGGCGAAGAACGTAGGAAATATAAAAAGAGAAAGAAATACGATGGAAAGGAGGGGTCGAGTAAGAAGCGCAAAAAACAGGAGTATTCCGATGAAATGGATGAAGCGGAAATGATGTGCGTTCGCGGAGCTTCGCCGGTACAGAAACAGTCTCAGGATGAAAGCTATCAGGAACAAGACTCGTACGAATCTGATAGCAGCCAAGATTCACGAGGACACCAGCAGCATCGCCAGCAGAGGCATCGACCACCACAGCGGGAGAGAAATAAAAATAATAAAAATGATAGGAGGAGGGGGGGAACACATCCCATGCAAGATCCCGATGGTGTTTGTCTGTATTACATGCAAGGAAAATGTCACAAAGGGGACGATTGTGTGTACTCTCACGACGCACAGCCGCCGAGGAAAATGGAACTATGCAAATTTTATTTAATGGAATGTTGCGCCAAAAGAGATAAGTGTCTGTACATGCACGCTGACTTCCCTTGCAAATACTATCACACTGGGCTTCCATGTATTTATAAAGACGAATGTAAATTTGCGCACGGTAAACCTTTAAGCGATGCACTCAAAAATATTTTATTGAAACACATCGAATCAGCTCCTAAGGAAATATTAGGTGACTTCCCAAGACTCAATAGAGATGGAGCGTTAAAAATGTTACAAAATACACAACTCAAACTAATGCAACAATATTCAGAGAGCACGGACGCTGAAATAAAGAATATTCCGTCACTCTTCGATATTAACATACCTAATCCTCAACTAAATGTAGATTCGTCTCAGAATAGTTCTTTTAACAACGAAAGGCAGAGCAAGGTATCACCTAAAGTAAGGCAGTCGAGGTGGCAAAATGACGAGTCGAATCAAAATCAATGTCATTCACCAAACAGCAATGCAAGTTCAAATGTTCTAAGTATTAAAAACTTAACCGGTGTATTATCGCCTCGACAAATTTATGAACTCACAAAGATAGGTATCGAAAACTTAGATCAATTGAGTCAGTTAACAGTATTACAGTTGAATAACATCGGTATATCCTTAAAACAAATCTCTGACATACAACTGAACACAATGAGCATCCAGAAATTAGGTTTAATTAGCAATACTGAACAACAATCCCCACATCATCAATTCGTTAGTACTAATGTTCCTGCAAAGGATTTAGATTTAAGAGTACCGCCCGCAGCACTGCCTTCGTCGAATAGTTTAGCTATAGCATCCGAATTACCCGGTCAAGACGTTGATATGCGTTTTCAGCCGAACACACAATCACTTAGTATCAAAGAAGAGTCGAGCAATAAAACATCTAAAAGAGAAACAAGCAATAAAGACATGATTGATATAGATCAGTACACAAAAGATGCGTTAAAATTTGCATCAAAAGATAAAGAAAACATAGACATTTCAAACGAGACTGTAGACACTGAAAAGAATAGTACTTCAAATGAAATAAAAGATGTAGATCACAGGGTCATTCCATTTTCTGATGGCGGGTCGAGTAGTCATAATGCTAATATAGAAGAGAATACTCTGCGTTCGGAGGCGGATACGGACATACGGTTTCTGCAACCGGATCCTATATTTAAAAATGCTGAGAAGAAACATCGTCGTTCGACTACGGACGACGATGAAGATAACAATTTACTGATCGACGAGAAATGGTATTCAAGTGACGAAGAAAAGGGAAATAAAAAGAAATCACCCATATCCTCACCTCAGAAGTCGTTGATGTCACCGCCGCCGGTGGTTCCGCCTGTTATAGAACCTTCGTCGGTTTTGAGTAAACTTGGTGATCTATCAAAAATAGATATCAGCGAAGAAGTTACAAAGCTGCTCAATACTATGAAACATAATTTACACGAAACTCCAAGTCAAGAAACTCAAGAGCCGACGATATCAAGAGATCCTCGTTCTAGAAGATCCCCTCCGACGTCTGTGGACACCAGTGTAGAAACAAGTAAAACGACGAGTAAGAAGACCAATCGAGTGTCTATATACGAATGTGTTGACGGTGAACCCACGGATGGGCGTCGAAGAACGGACGTGGATCTCCGGACGACGGATTTCAAAGGCCCCAGCTACGGTGACACAGACTTGAGACAAAACACCAGCGGTGACATAGATTTACGTCTAGGCTTACCGTTCAAACCGATACCTAACTACACGCCAGCCTCAGAAATAAACGGTTCAATTAACAGTCATCCGCCGATACATTACAAACTAGTTGCCATACACATACCGCGTCCTGATTACACGGATATCAAAAATAGCACAGCGAAATCACAAGCACTGACGGATCCTAGGTTAAGGAAAGTATTCAGATTGTCCGTAGAAGAAACTAATAGTGACAGTGAAAAACCAGCGAAAGTTGTAAATACAGGGCCCCGAGTGGACCCCAGACGGAAACCTAAAGATCAAATCGATACCACTAGTCAAGAACAAAAATCTAACTCATTAGAATTACAGACGATATTACAAAATTCAAATTGGTACAAAGATTTGAGTTCCACACAGAAGATATTCGTGAATCAGAATCTAGCACCGGTGACGCAGATGATAAAACAACACCATCAGGAAAAGCAAATGGGTAAGAAATTTGATATAGGTTCAATACAAAACAATAACGTTCTGTGTAGCATATTCACTAATCTAGGCGTGACTCTTGGAGAAAACGGTGAGTTCTCGTATTTACCTAAACCGAAGGAGGCTCTGCTGAAGACTCCGATAGGCTTTAGTCAGAATTCAAATCCTTTCGGCATGAACAACATGTCCGGAGGCCATGGGCCTATGGAGGGTAATATTAATATGATCAATATGCCGCCAATGGGCATGGCTGGTGTAGGAAACATGTCTAATATGAATTCATTGCACGGCTTCAACCAGCCTATGTCGGACCCCAGAGGTGGGCCGACTCCGGGCCTGCTGGGCATCGCGCCGAACATACCTCATAACTTTAACAACAACAAATTCGGTGGTCCACATAATTTCGGTAACATGGGATTCAACGGTCCGCCTAACGATTTTAATTTTATGGAGGGAGATCAGAACTTCCAAAGATTTCCCAACAGAGGAGGTCTCCGCGGCAGAGGGAACAATGATCGCTGGAACAGAGGTGGAAATAGGGGGCATAGGGACAGGAAAAATTTCAACGAGCGAGGTAATTGGAAAAACGACAGGCATTAG

Protein sequence:

>DPOGS206459-PA
MADPATVTEDLEDGEIESDGEETNETQNEEKPEEAAAVVEENKAKVDSQSFYYSKSGKKKKSKSNKSEQKVKDKSKKSRKYAETIEDDFAGGIEKAIRKAMNKNDGNNFQGEAEGEERRKYKKRKKYDGKEGSSKKRKKQEYSDEMDEAEMMCVRGASPVQKQSQDESYQEQDSYESDSSQDSRGHQQHRQQRHRPPQRERNKNNKNDRRRGGTHPMQDPDGVCLYYMQGKCHKGDDCVYSHDAQPPRKMELCKFYLMECCAKRDKCLYMHADFPCKYYHTGLPCIYKDECKFAHGKPLSDALKNILLKHIESAPKEILGDFPRLNRDGALKMLQNTQLKLMQQYSESTDAEIKNIPSLFDINIPNPQLNVDSSQNSSFNNERQSKVSPKVRQSRWQNDESNQNQCHSPNSNASSNVLSIKNLTGVLSPRQIYELTKIGIENLDQLSQLTVLQLNNIGISLKQISDIQLNTMSIQKLGLISNTEQQSPHHQFVSTNVPAKDLDLRVPPAALPSSNSLAIASELPGQDVDMRFQPNTQSLSIKEESSNKTSKRETSNKDMIDIDQYTKDALKFASKDKENIDISNETVDTEKNSTSNEIKDVDHRVIPFSDGGSSSHNANIEENTLRSEADTDIRFLQPDPIFKNAEKKHRRSTTDDDEDNNLLIDEKWYSSDEEKGNKKKSPISSPQKSLMSPPPVVPPVIEPSSVLSKLGDLSKIDISEEVTKLLNTMKHNLHETPSQETQEPTISRDPRSRRSPPTSVDTSVETSKTTSKKTNRVSIYECVDGEPTDGRRRTDVDLRTTDFKGPSYGDTDLRQNTSGDIDLRLGLPFKPIPNYTPASEINGSINSHPPIHYKLVAIHIPRPDYTDIKNSTAKSQALTDPRLRKVFRLSVEETNSDSEKPAKVVNTGPRVDPRRKPKDQIDTTSQEQKSNSLELQTILQNSNWYKDLSSTQKIFVNQNLAPVTQMIKQHHQEKQMGKKFDIGSIQNNNVLCSIFTNLGVTLGENGEFSYLPKPKEALLKTPIGFSQNSNPFGMNNMSGGHGPMEGNINMINMPPMGMAGVGNMSNMNSLHGFNQPMSDPRGGPTPGLLGIAPNIPHNFNNNKFGGPHNFGNMGFNGPPNDFNFMEGDQNFQRFPNRGGLRGRGNNDRWNRGGNRGHRDRKNFNERGNWKNDRH-