Monarch geneset OGS2.0

DPOGS212820
TranscriptDPOGS212820-TA5991 bp
ProteinDPOGS212820-PA1996 aa
Genomic positionDPSCF300086 - 325567-338350
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0081960.074.15% 
BombyxBGIBMGA000766-TA0.081.47% 
DrosophilaCG3173-PA0.027.77% 
EBI UniRef50UniRef50_UPI00021A77F80.034.40%UPI00021A77F8 related cluster n=3 Tax=unknown RepID=UPI00021A77F8
NCBI RefSeqXP_394944.30.033.73%PREDICTED: similar to CG3173-PA [Apis mellifera]
NCBI nr blastpgi|3287784360.033.75%PREDICTED: integrator complex subunit 1-like [Apis mellifera]
NCBI nr blastxgi|3407223400.034.26%PREDICTED: integrator complex subunit 1-like [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[309-391] IPR0221457.5e-17Protein of unknown function DUF3677
Orthology groupMCL12994 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212820-TA
ATGGAACGCGGTAAAATGTCGTCGACCGGCCGCGGTGGAAAGAGTAAAGCACCTCAGCATCCTCAGGATATATTTGCCTTAGGCAGTAAATCCACAGTATCAGTATCCAGGGACTCTGAAAAAAGAAATATACACAAACCGTCCACAAGTAGTGTTGGTGAAAGAAAGAGGGGAGAATCCTCAGGTTTTGGAGGCCAACCTGCAAGTAAACGGGCCAGACTTGGTTCCCCTGCCGAAGCCGTCACTGGCACCTCATTAGAAGTTGACCCTGTAGATCTTGTGCCAAATGTGTTACAAGCTCTTGATACCCATAACTCTGATAAATTGCTGGGACTTCTAACAGGATCAATACGTCTCTTAAAATCTCAGAGGTCTAAACCAGACCCTATATTATGTATGAGTATGCTCTACTTGACAAAGATACGTCCAAACATGTTTGCTCATGAAACAGTAACACAATCCCTGTGCACATTGCTCAAAAGGGAACAAGGTGCAGCTTTCAAGAGCAAAGGGAATCCTTTGGTGTTTGTACTGGCTTGTAATATGCTGTATGCTGGTCATAAGGATAGTAATAACTGGCCAGACACATTCATTAAGGTATATATAGAAGATGCTCTGAATGAACGCTGGTGGGTGGATTGCTCCTGGTGCAAATGCTTGGTGGAGAATATCCTCACAGCCTTTGGCACTAAACAGCCTCCCGCACATCTTATACCAACAGATAACAGTCTAGGTACAATGTCACCTTCTGGTTCGGGATCTCCTCTCATGGGGACCAATGAAGAAGACACTGACAACACTGACCTGGAATATTCAGTCTTCCCCAGGTACTCCAGCAGCTACGAGTCGGTGGAGGCCCTGGTACTGGAGGCCATCAAGGAGCAGATTCAGCGCCGAGCCGCGCCGGATGCCATCGGCAAGGGCTTCCTCAAATTACTGTCAGCCACTTGTGGTTTTCCAGAGATTAGAATGATAGCGGCGTCCCGGCTGGAAGCGTGGCTACACTCGGGCAAGTTGTGGCGCGCGGCCCAGGAACTCCTCGCTCATGTTTGCGCCAACGCGGCCGCCTCCGGGCCCAGCGCCGCTCGCGACCACGAGGTCCTCGCTCAGTTGGCCCGCATGCGCCTCAAGACAAAGCCGCTCCAGGCCGCTTACCAGGCCTGCCTGCGAGACATGGTCGCCGACAGCCCAGCGCTGCTGCGCTCCGTCGTCACCCACACCATCTACAACGAACTATCGAACGTCCGGTCGCCCAACAACATGGCAGTGCTGGCGGCACTCATCCACGCGCAACCCCACCTCGTGCCGGCCGCCATGGCCGACACGTACCAGGAGCTGGTGGTCCGCACCGAGGACTTCCTACGGCCGCTCCGTGCGCTCACCCGCGAGTGCGTCCGCGCCACGCGATCAGATGCGGCCGCGCTGCTGCCGCTAGCGAGGGCGCTAGCGCACCCGCCTCCGCAAGATCCGCCGCCCGAGATCCGCGAGAGAGCCTTTCAGGCCCTCGCCGACCTGTTCTGCGTCTGTTGCCTGGTGACGGCGGCACACAGTAAGCACTCGGCCGACTACCGCTCGCAGCTATGCGCGCTACAACAGCAGGCTTTAGGCTGGCTGCTGGACACCGCCGTGCCCGTGTACCGCCCGCCGCACCACGACTTCCTGCTGGCTCTCAATAAGATAATGTTCGTGGAGAGTGCGGAGACATACAGCAAGGTGGACAACTGGCCACCGGAGAGCGAGCGTGCGGTCACCTACCGCCTGTGCTGCGAGGCGCCACTGCCGCAAAACACGCTGCTGAGACTAGTCTTCATAGGACTCTCCAAGGCAAGTGAGATCCCCGTGTCTCCTACGGAAGTGTTCGAGCTGGTGGAGCAGGTGGTGCGGCGAGCGTGTGCTCTCCCCCCGGAAGACAAGCCGTTACAAGTTGACAAGTTGGAGGTGGCCGACTACATCTTCCAACTGTGCCAGTTTCATCCACCAGATAACATCACCCTCCCCACTGGGTACAGTCCTCCAGCGCTGGCCATCACGTCCCTGTACTGGCGCGGCTGGATGCTTCTGACGATGTTGGCAGCTCACAACCCTCAGGGATTCGCAGAGCGAGCAGCCGCCGCCTACCCCACGCTGAGGGCGCTCATAGAGTGCTGCATCACCAGCAAGCCGTCTATCGAGTGGGCCAGTCAAGCGGCCGAGTCCGAGCGCGCGGAGGCAGAGCGCGCGGCCGTCCTTCAGCTGGAGACACACCTGGCAGCTGCTAGCAACGCCAAGCTGCCCGTCACCGAACATTCCTCTAGACTTCTTGCACAGCTGACGACCTTGGAACCTCTCGGCCCAGCTCGTCGTCCTCCCGCCGGCGTGGTGGAGGCTTTACAGGCGTTGAGCACACAGCTTCGTCTGGGACGCCTGCTGTGTCGCCAGCCCGCGTTACTTTTGCAGCTGGTCGAAAGACACGGCACCAGGCGGGCCATGCCTTGGCTGCACCAGCTGCTGAGACACGACCGCCTGGAGCTCAGCGTACTTCCGGTGCAGTGCCTGTGCGAGTTCCTGTCGGCGGGCGGTGGAGGCGGCGAGACCGGGAAGGCCGGCGAGCTCTGCGCGCACCTCCGCCGCACCCTGCACAGCGAGGAGGGGGCGCGGGCCGTGCTGCACTACTACCTGCAACGCCTGGCGCTCGCACACGCGCCCACCAGGGCGGCCGCCAGCAGGGGATTGAAATTGGTGCTGTCGCAGACGGACGACGCGGCGGACATGGACTACAACGCGGAAGTCAGTCCCGAGTCGTGGCTGGAGCTGCTCCCTTCTCTGCGTCACTGGGAGGCGCTCCGCGGCGAGGCGCTCCGCCGTATCCGCGCGGCGTGCCTGGCGGAATGCTCCCCCCGCAGCCTGGCCGCCTACCTGTCCTTCCTGGCCGACCACCACCTCCACCACCAGCCGCTGGGCGACCTCGTGCTGGACTTGTCCCAGGTGTTGATGGAGCGCACCACGGTGATGGGGTACGTGTTGCCGGCCGTGGACAGCAAGGAGCCGCCCCGGGAGCCGCGCGCCCTGGCCCAGCACCGCGCCCTGCACGCTCTCGCCTCCGTGTTCTACACACACCTGCGACAGACGTTGAGTGATTATATCTTTAATGTAACGGCTGCCCCGTCGTTCCAGGTGCTGTCGTCCCCGGAGCCCGAGCCCGAGGAGGTGTCGTCGGAGGCGGCGGGCTGGAGCGGCGAGCGCGTCACGCTGCAGTGGGCCAACGGTCGCCGAGCCACCATACACGTGGTGGTGGCGCACGCGCACCTCAAGCTGTTGTGTTACGGACCTTCCTGCTACGACACGAATCAAGAGATGTACTCGTGGCTGCAGTCGACGTGGGTGGGCACGGACGCGCCCGAGGCCTTCACGTCCGAGTCCCCGGACGAGGCCGTGCTGCTGCCGGACTGGCTGCGGCTCAGCCTGGTGCGGAGCGCCCGGCCCGCCCTGCTCGAGGCCGGCCTGCGAGGTCTGCCCGCACACAAGCTGGCGCTCTTCATACAGACCTTCGGCATGCCCGTCTCCTCTATGAGTGCTCTCCTGAACGCCCTGGACGCGTGTTCTGCGGGCGCCGTGGTCCGCCTGGGTGTGGAGCGAGCGTACATGTCTCAGCTGCTGAGAGTGCAGCGAGCGCGAGGGGCGCAGGGGGGACACGCCTTCGCAGCCGCCCTCCGCCTCAGCAGACCCGTCTACCCGCCCGATGACACTTTGTTCGCTGAGGAGACTCTTCCTGAAGAGGAACACGACCCTTGGAGCATGCCTCGCGAGCCCACCAGGCTGGACGCGGGAATGGTGGGCGCTCTCATCAACACCGCCTTCAACGGCGCCGGCACCTTTAAAGGGGATCTGGATACAGCATTCACACAGCTTAATGGGTTGATATCGGAAGAATGTGCGAGTGGCGGTGCGTCCCCGGTGACCGGGGCGGCGGTCGCGGGGCTCCGCGGCGTGTCCGCGGCCGCCCTGCAGCTGCGAGCCGCCTACGCCGCGCCCTTGATTCGCTCTCTGGCCTTCGCTAAACCGCCCGGGTTCGCCGAGTTGGCCAGTTCCCTCCTGTCGCAGTGTAAGTTGTCCCGGGGCCCTGTCCCAGACGCGCTCCGTGAGTCCGCTGGTCGCGCCGGCGCCGCGTCCCGCCCCGCCTACGCCGCGCTCGCACGCCGCGCCACCAAAGAACAGCTGGTCGAAGTATTTGAATCGGCCACCGCCAGCACGATGGAACAGATCGGAAACGAGATCATAGAGACCCAGGACACGCAGCTGGTGGTGGACACCATCACCACCCTGCTGCAGCGAAACCAAGAGGGCCGCTATGAATTGCCCGTCCACTTCTACCTTCAGCAGGAAGAGAAGTACGGTCAGTCGAAGTCGCTCGTGGGCGACGCTCTGCTGCTGTCGCGGCGGGGGCTGGGGTGCGGCCTGCTGCTGGATTGGTTGGCGGAACTGCAGCGAGAGACGCTCGACTCACAGATGCGGTTAATGTTCGTACGCGGCGCCGGGGCTGGTGGCGGCGGCGCCTGGCGTCCCCTGCTGGTGACGCTCGTGGCTCACCGTGCGTCCTGGAGCACCCTGCACGCCTGCCTCGCCGCCTTGCTGTCCACCAGGAATTGGTCGGCCCGCTCGGTGCTGGACTTCGCGGAGACGTTGATCGGCAGCCCGAGGGTCTGGCAGGGGAGAGACCGCAGCACGCCCAAACACCACTCACCCGACGATTCACTGAGGCTCACCAACAAACAGCTGGAAGTGTTGATCCACTACATGGGCGAGGAGGCGCGCGAGGCGGAGGCGGCGGAGGGCGGGGAGGCGGCGCGGCGCCGGGTGGAGGCTCGTCTGCCGCTGCTGCTGAGGTGCTGCTCCACCCCGCAGGCGCTGCTCGCCGCCGCCCTGGCCGCCTCCAACACACATCCCTTGCTGCTGTTGCTGTTGTACATGAAGGTCCCCAAGGTGTTACACCTGCTTCGCACCTGCGGAGACCGTCCCGCCGTGTTGGAGGTGTCTCCGGCGGCGGTGAGGGCGGCGGCCTGGCGCTCCACCAGCGCCACCGACAAGGTGTCCCACTGTCTGCTCACCGCCCTGGCCGCGCCGCATCACCACTCCAAGGAGAACTCCCAGAAGCTGGTGCGTGTGGAGAGCGAGGTGCGGGCGGTGTGGTCGCGCGTGCAGGGCGCCGGCCAGCGCGGCCTGTCCCTGGCGGGGGCCCTGCTGAGAGGGGCGGCGCCGCACGGACAGAGACACGCTCACAGCCACCTGCTGGCCGCCATCGAGATGCTGCCGGATGAGGAGCTGTTCGCGACACACGTCTGTGACGAGGTGCACGGTATCCTGGAGTGTTTCCTGGGCACGGTGAAGCAGGGCAGTGGCGGAGGCAGCCTGGCTCACCGCGTGGCCGCCCTGCTGAGACGATACCGCGCCGCCCGCCCCGCCCGTGCCGCCGCCCTGCTCCACACACACCGGGACATCATCGCCTCCAACCCCGCCCTGAGCGGCGCGTGCAGCGCCGAGGGTGCCGCGTCCTCGTCTCCGCCGCCGCGGGCGCTCCTCGCCTTGCAGCGCCGCACCGCCTCGCCTGACGAGCTCCATTGGCTACTTCAGGAGGTGGAGGCGTGGGGCGTGCGTCGCGGCGGATCGTGGGGAGGTTCAGGCGGGGCGGACGCTCTGTTGCGTGCGGCGGCTCCTCTGGCAGCCTCACCCCACGCTCCTCTACGCAACGCCGCACTGTCTCTGCTAGCCAAGCTCTTGCCCGCCGTTCCCGACACACACCCCGGTCTCCAAGCTGTTCTGGAGTGCCTGGACTCAAGCCAGCCGGAAATAGCTCAGTCCGTGCTGGACAAACTCCCGGAGCTAGTGGTGGGTATGCAGGAACACGCGTCAAGACTCCTGATGCGCGTGTTCGAGATGGGCATGAAATCTCGGCTGCCGGTGGAGCAGTGTATCGCCAAGTGCGTGGCCACCATCAACACCAACCGGGGCTGCTGA

Protein sequence:

>DPOGS212820-PA
MERGKMSSTGRGGKSKAPQHPQDIFALGSKSTVSVSRDSEKRNIHKPSTSSVGERKRGESSGFGGQPASKRARLGSPAEAVTGTSLEVDPVDLVPNVLQALDTHNSDKLLGLLTGSIRLLKSQRSKPDPILCMSMLYLTKIRPNMFAHETVTQSLCTLLKREQGAAFKSKGNPLVFVLACNMLYAGHKDSNNWPDTFIKVYIEDALNERWWVDCSWCKCLVENILTAFGTKQPPAHLIPTDNSLGTMSPSGSGSPLMGTNEEDTDNTDLEYSVFPRYSSSYESVEALVLEAIKEQIQRRAAPDAIGKGFLKLLSATCGFPEIRMIAASRLEAWLHSGKLWRAAQELLAHVCANAAASGPSAARDHEVLAQLARMRLKTKPLQAAYQACLRDMVADSPALLRSVVTHTIYNELSNVRSPNNMAVLAALIHAQPHLVPAAMADTYQELVVRTEDFLRPLRALTRECVRATRSDAAALLPLARALAHPPPQDPPPEIRERAFQALADLFCVCCLVTAAHSKHSADYRSQLCALQQQALGWLLDTAVPVYRPPHHDFLLALNKIMFVESAETYSKVDNWPPESERAVTYRLCCEAPLPQNTLLRLVFIGLSKASEIPVSPTEVFELVEQVVRRACALPPEDKPLQVDKLEVADYIFQLCQFHPPDNITLPTGYSPPALAITSLYWRGWMLLTMLAAHNPQGFAERAAAAYPTLRALIECCITSKPSIEWASQAAESERAEAERAAVLQLETHLAAASNAKLPVTEHSSRLLAQLTTLEPLGPARRPPAGVVEALQALSTQLRLGRLLCRQPALLLQLVERHGTRRAMPWLHQLLRHDRLELSVLPVQCLCEFLSAGGGGGETGKAGELCAHLRRTLHSEEGARAVLHYYLQRLALAHAPTRAAASRGLKLVLSQTDDAADMDYNAEVSPESWLELLPSLRHWEALRGEALRRIRAACLAECSPRSLAAYLSFLADHHLHHQPLGDLVLDLSQVLMERTTVMGYVLPAVDSKEPPREPRALAQHRALHALASVFYTHLRQTLSDYIFNVTAAPSFQVLSSPEPEPEEVSSEAAGWSGERVTLQWANGRRATIHVVVAHAHLKLLCYGPSCYDTNQEMYSWLQSTWVGTDAPEAFTSESPDEAVLLPDWLRLSLVRSARPALLEAGLRGLPAHKLALFIQTFGMPVSSMSALLNALDACSAGAVVRLGVERAYMSQLLRVQRARGAQGGHAFAAALRLSRPVYPPDDTLFAEETLPEEEHDPWSMPREPTRLDAGMVGALINTAFNGAGTFKGDLDTAFTQLNGLISEECASGGASPVTGAAVAGLRGVSAAALQLRAAYAAPLIRSLAFAKPPGFAELASSLLSQCKLSRGPVPDALRESAGRAGAASRPAYAALARRATKEQLVEVFESATASTMEQIGNEIIETQDTQLVVDTITTLLQRNQEGRYELPVHFYLQQEEKYGQSKSLVGDALLLSRRGLGCGLLLDWLAELQRETLDSQMRLMFVRGAGAGGGGAWRPLLVTLVAHRASWSTLHACLAALLSTRNWSARSVLDFAETLIGSPRVWQGRDRSTPKHHSPDDSLRLTNKQLEVLIHYMGEEAREAEAAEGGEAARRRVEARLPLLLRCCSTPQALLAAALAASNTHPLLLLLLYMKVPKVLHLLRTCGDRPAVLEVSPAAVRAAAWRSTSATDKVSHCLLTALAAPHHHSKENSQKLVRVESEVRAVWSRVQGAGQRGLSLAGALLRGAAPHGQRHAHSHLLAAIEMLPDEELFATHVCDEVHGILECFLGTVKQGSGGGSLAHRVAALLRRYRAARPARAAALLHTHRDIIASNPALSGACSAEGAASSSPPPRALLALQRRTASPDELHWLLQEVEAWGVRRGGSWGGSGGADALLRAAAPLAASPHAPLRNAALSLLAKLLPAVPDTHPGLQAVLECLDSSQPEIAQSVLDKLPELVVGMQEHASRLLMRVFEMGMKSRLPVEQCIAKCVATINTNRGC-