Monarch geneset OGS2.0

DPOGS210059
TranscriptDPOGS210059-TA5964 bp
ProteinDPOGS210059-PA1987 aa
Genomic positionDPSCF300017 - 865743-905060
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0133580.073.76% 
BombyxBGIBMGA012699-TA3e-10279.45% 
DrosophilaCG2083-PB7e-16547.90% 
EBI UniRef50UniRef50_E2A0C10.057.24%Protein TET2 n=5 Tax=Formicidae RepID=E2A0C1_CAMFO
NCBI RefSeqXP_396330.30.056.45%PREDICTED: similar to CG2083-PA [Apis mellifera]
NCBI nr blastpgi|3800294960.056.63%PREDICTED: uncharacterized protein LOC100866593 [Apis florea]
NCBI nr blastxgi|3454872430.033.12%PREDICTED: hypothetical protein LOC100114438 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL20720 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210059-TA
ATGAACGAAAAGTACTCAACTTGCTTCCCTTTGCAAGTCGATAGTCGTGAGAGCGGGACGGCGGCACAGGCCGTCTCCGTCACACATTCGAGGATGACACTTCCGAGAAACAGCTACCGTGGTCGCAAGCCTCTAGGAGCGGTAGTTCCTGCGGGACTGAGCCCCGGTGTGCCCTCGTCCGCTTCCCCCGGGCCCGCCCCGGCCGCCCCCAGCCCTGCACCAGCCACCACGCCCGCTATGAAGCAAGACCATCCTAGCCAGCCCATGGCTCCAATGCCATTTTACGCGGGTGACCCGAACAGATTTCCTACAACGTGGCAAACGGATCCGTCGCAAGGTTGGCAAAACCAATTTATTCAGCAGCTGCCAACGCAGACGGGGCAGACGACTCTAGACTACCAGGGCAATCATACTCCGTACGCCACTTCCCCCCACTATCAGACCAACACAACGTACACCGTCCAAAATGTTGCAACTACATTCGATGTCACAAACAACACGTATTATCAAACTGGCGTTCAAGCTGTTCCAGCACAACGCCCGCCATCGAACCGCGGCGCTTATACTCCGGTGCCATCTCCCCGGGCTCCACACTATACAACAGAGTATCAACAACAATACGCACAGCAGGCCCCGACTACATGCGTCACCACGGGGAACGACTCGAGACCCGCCTCAGCTGCATCCTATCAAAGTTCTGTTCCAACCTCATCTGCGTCGGTATATACTGTGAGTCAACAAAGTTACGACCGCAACGAATACACCTCGGAGCATAACGACGATTATAACAACGGGGAAAACAGCACAGGGGAAGCGAGTAACGATGGAGAAAGACAAGAACAAAATGCGTCAAACAATCAGCATTCGGGAAACGCCGAGCCTGGATACCTGCAGGGGAGCAACGGTCCGCGAGCTTCACTCGATTGTAATGGGTATCCGGGCAGCTATGCGAGCGGGCAGTACACGGATAATGGACAGACGCAGAACCAAAATGACTGGCAACAACGCCAATGGCAACATAACCAAAGAATTCAGAATCAACAGTTACAGAATCAACAGCAGTTACAAAATCAACAACAACTACAAAATCAACAACAATTACAAAATCAACAGCAATTACAAAATCAGCAACAAATGCAAAATCAGCAGCAGTTACAAAATCAACAGCAGCAACAGCAGTTACAAAACCAACAACAACTTCAAAATCAGCAACAAATACAAAACCAACAGCAAATTCAAAACCAGCAACAAATGCAAAATCAACAGCAAATGCAAAATCAACAGCAAATGCAAAACCAACAACAAATGAATCATATGCAAAACCAACAACAACAATTGCAAAATCAACAACATCAGTTGCAAAATCAAATACAAAACCATCAAAATCAACAAATGCAACAGCATAGACAAGATGAATCAGAGCAACAAATGTTCTCACAATCGGACAGGGTCAATTTAAATTCGCGACTTAAAACAATGATTTTAAATAAACAAAGTCATATACGAGACGGCACAAACACTCCGCCAGATAAAGGTGACCCTGGAGAGGGGCCTCCTCGTGTATTAGATGATAGAAAAACCGGTGTCAACGACGATAGAAATACCACCGGTCATTTTTTATCGTATAGCCACCATCTCCGAGGTAATTACCACATAGGTGAACAGGGTTACCCTGTCACTGGTAATTATTCACAAAATACTCAGGCCTATGGCTTGAGTGATATCGGAGGTGGTGGCCACCATGTATGGGAGGGGGGAACGCAAGTTCCGAAAGGTTATAAGCAAAGTATATCGAAGAACGTATCTAAAACTCCACAAAATTTACCGTCACACGCTAACGTAATAGGACAATACAGTACATATGCTAACACGCCTTATGAAGCACAAAAATATGCAGACACGTACGGTAGTGAGAATTTAAGTTATAGTCAACAAGACAACAAAGATTTTCATTCTAAGATGCTCATTGGACCAGTGTCAGATATCAACCAACAAACCTGCTCCACGACTCAAAAAATCAATACACAAAAAGGAACTTATGAAAGAGAGACCAGTGATATTAAGTATAGGCAAGCAGCGGCGCCTTTAGTCCCTAAGTTAGAAATACCAGACAATTATCAATATGTGAAGGAGGCTATTAAAAGTGAACCACAACGATCGATGAACCAAAATGCATTCGGTAAGCAAATGGAAGGGCTAAAACATTTGCCTTATCCTAATGGGACACAAAATATGGCATTACCACCAATATCTACAATAAAACATGAAAATTTAAATCATAAACCACATGTCTATCAAAACTTTCAAAATATACCTTACGAAAGGAAAATTAATACAAGCGAAGACTTCGCTCAAATTCCAAGGATGCAAGAAAATATAGCTTTTCAAAAGTTCAGTAAAAATTTTGAATCCAAATTTAATCGGGGTGTAGACTGTCGAAGCAATTCAGGAAAAAATATAGACATTGATTTAAAACCTCCATATTATATAGAACAAATAAAGTCCGAACATTCACCTCCTGGACATAAAATATATAAGAATTTGCTCTATGGACCCCCACGGAGTGAGCCTTACATGTTCGCAGGAGAGGGAGGCCCTAATGCGTTCAGAAATGAAATAGGGTATGCGTGTTGTAGACAAGGCTCAGTAAAAAAACCTCCCCCAGAACATCTAAGGGATGGAGCTTGTGCAGGTCTGCAAACTAAAGACGAAATACTAGAAGAAGACCCAGACAGTACAGATAATTCGAAGACTCCATCGAAACCTGGTACTCCTATATCTGATCTCTTCCCAAAAACAACGAAAGAAAACCAATTTAATTACTCGAAGGAATATTTAGATAACTTAGAAAGATTAAAAAATAATTCCAGAACAGAAGTACCAGACTGTAATTGTTTTCCAGCTGACAAAAATCCTCCTGAGCCTGGAAGCTACTATACTCACTTAGGAAAAGAAGGCAAAACAGCTCAGGGTTGCCCGATGGCTAAGTGGATAATACGGCGTTCCAGCTATACTGAAAAAGTACTAGCCGTAGTTAAGTTCCGCAATGGTCACAAATGCTCCACTTCCTGGATAGTAGTTTGCTTAGTGGCTTGGGAAGGGATACCGCAGTCCGAAGCTGATTTAGATTACACGCTGTTGTCCCACAAACTGAATAGGTATGGCTTGCCGACTACACGTCGGTGCGCTACTAATGAAAATCGAACCTGTGCTTGTCAAGGGCTCGATCCGGAGACGTGCGGCGCGTCCTACAGCTTTGGCTGCTCCTGGTCCATGTATTACAATGGATGCAAATACGCCCGTTCAAAAACTGTCCGTAAGTTTCGACTTTCGGTCAAAACAGAGGAGAGTGAAATCGAAGAACGCATGCACGTCCTGGCCACCCTACTTAGCCCCTTGTACATGAATCTTGCTCCAAAGTCATTTGAGAATCAGTGTCAATTCGAGAAAGAAGCGTCGGATTGCCGTTTGGGCTTTAAGCCTGGAAGACCGTTTTCAGGGGTAACGGCATGTATAGACTTTTGCGCTCATGCACACAGAGATCTTCATAACATGAACAACGGCTGCACAGCAGTAGTCACACTCGCCAAACATAGAGCACTCACGAAACCCAATGACGAACAGTTACATGTCCTTCCATTGTATGTTTTGGACACCACAGATGAATTTGGGTCTAAGGAAGGTCAAGAAGAGAAAATTGCCAGCGGGGCTTTGGAAATTTTGGACAAATATCCTATGGAGGTCCGCGTTAGGTCCGTCCCACTGCAGCCCTGTCGCCGGCATGGGAAAAAGAGGAAAGACGATGAAAATACTGAATCGAACAACACATCAACAAACACTCCACAACAAAGTCCAAATAACCAAAAGAAATCACAATGTGCCACGCCGACGCCGAAGACTCCGCAACCCAATGAAATTAGAAACAACGCGAGTCCACGAAGTCAGAGCAGTGCTTCACCGAGAGCCAGCACGCCCGCTATGAATCTGCAAGTTAACACTGGCTTCACGAATACTCCACACTATAACTCCGCTTTCGTGAATCCAAATGTACACAATTCCAGTCTCATGAATCCAAACCTCGCGAATCCAGCATTACTCGATATGGCGACAATGATTGATAATTTCACTGATGCTCAGTTACAGAGCAATCAGATCTCTAGCACTGTACTGGATTCTCCGTACAACCCTTACGATCAAAGCTATAGTTTACATAATCAAAACGCTATGTATAACCCACACAACGTCGCTCCTGTACTAGATAACAATTGCCAAAATCAAAATCCTAACCAGCTGCCGAGTTTTACTCCAAACTTACAAAAGAGAGACCAATGCAATCCGCAGAATATAAATCAAGTGAATGCGGCTCCACAACAGTGGCCCGAATTCGATAATTCAGAAATGTTCAGCAATGTGGTGAAGGAAGATCCAGATTCTACAACTGCTAATTTAAATGCTGCTAATAATTTTAGTCCCGATTCAAATCGAAGTGAAAGCAATTCAGAAAATATATCCAGAAGCAATACGGAATCCGAAAAGCTCAATGCTTTAGGAGATTTGACACCAAAATTACCAGAATTTGATATGCCCAGTTCACCAAGGTTATCTGAAATATCTCAAAAAACCCAACGAACGCCCGAATCACCGGTGCCTTCACAAGAACTTAAATCCCCTAATCATGACTCGGTTAATTCAGACGTTCGACGAAGTGTATGGAGATCTCCAGACACAAAAAGCTGGGACCCGATGAAACCTAGAGACCAAGTCACTGAAAATGAGAATGCATTGTTTAGAGTGCCTAAAGCGAGGCCTCCATCGCGATCGCAATATGTGAATCCTTCCGATGGCATGGAAAACTCTACGTTTCTTAAACCGTATCCTCCATCTGAACGATCCTACCACAACCAAGGTTATGAAAACAGAAGTCTCCAAGAGAATAGAATGAACGCGTTACCGCCGACGTCACAACAACAAATGAACTACACCATCAATCATGAGGACTCACAGAATATAACAACGAATAAACCCGCGCCGGAGTTCACGGGAAATAACTTTCAACCGATAAACCAAACTACTTACAGCAATCAAGTGCCAGTCCAGGGTTATGGAAACTATCCCACGATGACCAACGCTTACTCTTTTACGCCGAACCCCTATGGGCATTTCAACCCGTATGAAAACTCTTACAACGACTACAATAGTATAGCTTATTATAATGCCGAAAAATACAAAAGAGAAGAATTAATACGAAATTCCCATTGCTATAATTCATATGGATATCAGTACAATCCAAATTTTGCAGCAAATTTCTACCAGAATCCAAATCCCAATTGGTGTCAGTCTTCGCCCAACTGGTGCCTGTACCCGCCACCTTTTTCTATTCCATATCCGCCCGAACCACCAAAAGCAGAACCGATCGGGGAAGTTACTGGAATTAATGACAATCTGGAGTGTTTTAAAGATAGTCAAATGGGAGGAGTTGCTATCGCTTTAGGGCACGGGAGTGTATTGTTTGAATGTGCGAAACATGAGATGCACTCCACCACCGCGGTGAAGAGCCCGAACAGAGTGAATCCAACGAGAATATCACTCGTCTTTTACCAACATAGGAATCTGAATCGTCCAAAACATGGCCTTGAAGAGTGGGAAGAAAAAATGAGATTGAAAAAGTTAGGCTTAAGCCCGTCGACACCGGGAACCCCGGGAGCCAGCTCGAACACCGCGTCACCGGCCTCCACACCCGGCCTCGAGCAGAGACAGACACCAGCCGAGAAATGGAAAAGCAACGAGAACGCGATACCAGGAGGATATAGCTCTTTGGCGGCTCTCGTGGAGGCGACCAACGCGGCCAGGAGGAGTCGCCAGATTATGTTGAGGACGGACACGCAGACCACCATGTCGTGGACCACGTTGTTCCCGATGCACCCGTGTACTGTGACGGGACCCTACCAGGAATCCGGCACCTGA

Protein sequence:

>DPOGS210059-PA
MNEKYSTCFPLQVDSRESGTAAQAVSVTHSRMTLPRNSYRGRKPLGAVVPAGLSPGVPSSASPGPAPAAPSPAPATTPAMKQDHPSQPMAPMPFYAGDPNRFPTTWQTDPSQGWQNQFIQQLPTQTGQTTLDYQGNHTPYATSPHYQTNTTYTVQNVATTFDVTNNTYYQTGVQAVPAQRPPSNRGAYTPVPSPRAPHYTTEYQQQYAQQAPTTCVTTGNDSRPASAASYQSSVPTSSASVYTVSQQSYDRNEYTSEHNDDYNNGENSTGEASNDGERQEQNASNNQHSGNAEPGYLQGSNGPRASLDCNGYPGSYASGQYTDNGQTQNQNDWQQRQWQHNQRIQNQQLQNQQQLQNQQQLQNQQQLQNQQQLQNQQQMQNQQQLQNQQQQQQLQNQQQLQNQQQIQNQQQIQNQQQMQNQQQMQNQQQMQNQQQMNHMQNQQQQLQNQQHQLQNQIQNHQNQQMQQHRQDESEQQMFSQSDRVNLNSRLKTMILNKQSHIRDGTNTPPDKGDPGEGPPRVLDDRKTGVNDDRNTTGHFLSYSHHLRGNYHIGEQGYPVTGNYSQNTQAYGLSDIGGGGHHVWEGGTQVPKGYKQSISKNVSKTPQNLPSHANVIGQYSTYANTPYEAQKYADTYGSENLSYSQQDNKDFHSKMLIGPVSDINQQTCSTTQKINTQKGTYERETSDIKYRQAAAPLVPKLEIPDNYQYVKEAIKSEPQRSMNQNAFGKQMEGLKHLPYPNGTQNMALPPISTIKHENLNHKPHVYQNFQNIPYERKINTSEDFAQIPRMQENIAFQKFSKNFESKFNRGVDCRSNSGKNIDIDLKPPYYIEQIKSEHSPPGHKIYKNLLYGPPRSEPYMFAGEGGPNAFRNEIGYACCRQGSVKKPPPEHLRDGACAGLQTKDEILEEDPDSTDNSKTPSKPGTPISDLFPKTTKENQFNYSKEYLDNLERLKNNSRTEVPDCNCFPADKNPPEPGSYYTHLGKEGKTAQGCPMAKWIIRRSSYTEKVLAVVKFRNGHKCSTSWIVVCLVAWEGIPQSEADLDYTLLSHKLNRYGLPTTRRCATNENRTCACQGLDPETCGASYSFGCSWSMYYNGCKYARSKTVRKFRLSVKTEESEIEERMHVLATLLSPLYMNLAPKSFENQCQFEKEASDCRLGFKPGRPFSGVTACIDFCAHAHRDLHNMNNGCTAVVTLAKHRALTKPNDEQLHVLPLYVLDTTDEFGSKEGQEEKIASGALEILDKYPMEVRVRSVPLQPCRRHGKKRKDDENTESNNTSTNTPQQSPNNQKKSQCATPTPKTPQPNEIRNNASPRSQSSASPRASTPAMNLQVNTGFTNTPHYNSAFVNPNVHNSSLMNPNLANPALLDMATMIDNFTDAQLQSNQISSTVLDSPYNPYDQSYSLHNQNAMYNPHNVAPVLDNNCQNQNPNQLPSFTPNLQKRDQCNPQNINQVNAAPQQWPEFDNSEMFSNVVKEDPDSTTANLNAANNFSPDSNRSESNSENISRSNTESEKLNALGDLTPKLPEFDMPSSPRLSEISQKTQRTPESPVPSQELKSPNHDSVNSDVRRSVWRSPDTKSWDPMKPRDQVTENENALFRVPKARPPSRSQYVNPSDGMENSTFLKPYPPSERSYHNQGYENRSLQENRMNALPPTSQQQMNYTINHEDSQNITTNKPAPEFTGNNFQPINQTTYSNQVPVQGYGNYPTMTNAYSFTPNPYGHFNPYENSYNDYNSIAYYNAEKYKREELIRNSHCYNSYGYQYNPNFAANFYQNPNPNWCQSSPNWCLYPPPFSIPYPPEPPKAEPIGEVTGINDNLECFKDSQMGGVAIALGHGSVLFECAKHEMHSTTAVKSPNRVNPTRISLVFYQHRNLNRPKHGLEEWEEKMRLKKLGLSPSTPGTPGASSNTASPASTPGLEQRQTPAEKWKSNENAIPGGYSSLAALVEATNAARRSRQIMLRTDTQTTMSWTTLFPMHPCTVTGPYQESGT-