Monarch geneset OGS2.0

DPOGS214064
TranscriptDPOGS214064-TA3885 bp
ProteinDPOGS214064-PA1294 aa
Genomic positionDPSCF300171 + 27645-32389
RNAseq coverage455x (Rank: top 27%)
Annotation
HeliconiusHMEL0082730.078.14% 
BombyxBGIBMGA010558-TA0.069.57% 
Drosophilaex-PA4e-9442.81% 
EBI UniRef50UniRef50_B0W1I33e-12142.26%Expanded n=2 Tax=cellular organisms RepID=B0W1I3_CULQU
NCBI RefSeqXP_001842567.15e-12242.26%expanded [Culex quinquefasciatus]
NCBI nr blastpgi|1700293731e-12042.26%expanded [Culex quinquefasciatus]
NCBI nr blastxgi|1582998680.037.75%AGAP009126-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054882.2e-18binding
GO:00055158.6e-16protein binding
KEGG pathway 
InterPro domain[56-270] IPR0197495.1e-21Band 4.1 domain
[156-262] IPR0143522.2e-18FERM/acyl-CoA-binding protein, 3-helical bundle
[159-261] IPR0197486e-17FERM central domain
[282-356] IPR0119938.6e-16Pleckstrin homology-type
[283-357] IPR0189809.1e-16FERM, C-terminal PH-like domain
Orthology groupMCL15571 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214064-TA
ATGTGGGGTTCAAACAGTGCAGGTAGCGAAAAGCCAATTGTGTACGAAGAAAAAAAAGAGAGAATGGAATGCGCAGATGGTGTAGGGAGAGTAATAGAGGTATCGAGATGCGATATGCGCGCTCTGTGTTCGGTTCGCGGGCTGGGCGGGGAGGCCCGAGCGCTGGGGGCAGGCGCACGCCTCCTATCCCTCCGGATGCCCGGCCAGCCTCAGCCATTGCACTTCGTCGTGGAAGCCAAGGCCAGGGTGAAGGAATTGAAAATGTTAGCCTACGCACACATACAACTTCAAGGCATGACCGATACAGAACTGTTCGGTCTCGCTATTATGCAAAATGGCGAATACCTCTTCGTAGATTTAGAAAGTAAACTTTCAAAATACGCACCGAAAAGCTGGAGGTCGTCCCATACTCATGGCTTAGATGCAAATGGGAAACCGCTTTTGGAACTTCATTTAGTTGTTCAGTTTCACGTGGAAAGTCCGCTACTTCTACACGATCAATGCGGTCGCCATTTATACTTCCTACAACTTTTGGAAAATATTCGTACCAGAGACGTGCTACCTGGTGAAATATTATTACTATTAATTGGATTAGCACTCCAAGCAAAATATGGAGATGAAGATTTATACGAAAATCAGGATTACTTCAAAATAGAGGACTTCGCACCACCCTCACTTACTGGTGAATGGGTTGTGTCGGCTATACGAGCCTGTCATCGCGAACACCATGGCCTCTCCAAATCCGACGCAGAAATTAGATTCATTCGAGAAGTTAGTCTTCTACCAGATACAATAAATTCACACAGATATCGACTGAAACAGTCAAAAACGGAATTAGAACCAGGAACGGTGTGGTTACTTGTAACAGCTAAGGGCATTAAAATATTACCAGATAATGGGCCACTGTCAAATTTTATATGGAGTTCTATAGGGAAACTTAGTTTTGATCGAAAGAAATTTGAAATTAGAACCGAAGAAGGAAAAATAACGTTATATTCTTCTAGTGAGGAGAAATGTAAATATTTGTTCGCATTATGTAAAGAAACGCATCAATTTTCTATGAAAATTTCATCAAAACTAAATGAAATATTAACGAAGGAAGAAGATGAAAGAAAAGTTTGTTTTGGATATTCAAAGGCATTAAACTTTCATTACTGTCATAATAAAAATGAACAAAGAATATCCCTCATATCGTCAACCAGCTCAAACACCACGTCAGGAATAGTCAGTGACAGAGTTCAGTCGGAAGATGAATTAGAAATAATGATTGATACACCACCGGCACCTTCGACGGAGAGTTTAGCATTCGCACATTTATTAGATTGTTCCAATTCTTATTTAATTCGACACATGCCTCATGACCAGCATTCTTTAAAAACGAGATCCTCATTACAACTTCCTATGTCAAAGCCCAGGATGAAGAAAAACACAGAAGACACTGAAAATATATGCAACAATGACATTTCATTGGACCGAGAAGGTGGTTCAACCAGTCAAGCAGAAGAAACAATTTCTTTGCCAGATCGTGTTATTGACAAGTCTGACAGCAGCCCAGGTTCTAGCAAAATAAAATGCACCGGCTCACAATGTTCATCGTCGTGCAGTACAGTTATTTTAGCCCGGGGAGGTCTCAGTACCTTGAGTAGAGCATCAAATGCCAGCAGCTTAGAATTGGGCTACAGTCACACGGCACAAAACTCTATGATAAGTGATAACAGTACGGGTGCAGTCGATGGAGAATATACACAAGATACAGCATCAGCTTTATATGATGGTCTGGGCCAACCAGTTACAGTAGCAGCATCAAGTGAAACTAGTGGAGTGTATACAATGGGTAGTTCTGAATTGACGACAGGATCAAAATTGTATGCCAATTCTGAAGGAAGCCGAACAGAGTATAGCGAATCAACTTATGACAGTTACAAAATCATAAAAGAAAACGACCTGGCTGATTTTGACAGTGTGTCATCAATACTTAAAAATAAATCAAAAAGAACTGATGCATCACCAAATCTTTTAAAGTCAAGAATATCTCTTACCAATGACTGTGTTGACGGTACATCACAATTTTGTAGTGAAACACGTGACCTCAAGCAAAACATGTTTAGAGAGCGTACTAACTCTAACGTAAGTGCCCTTTCATTTCATGGTGATGGGAGTGACCCAGCAGACAATAAACATAATTTGCTTAGTGCAAGTGAGTTAAGCGATCTGATTGTTGGTAGAGGTGTTTATCCTAAGAATCAATCTGTAAGCGACACTTTCGATTCGGTTTCGGATTACGTGAGGTTACCATTGCCATTTACCGGTGATAGTTACCTGAAAGGGCACGAAGACACTGCGCCATCTGATGATAACTATCCAAATTTTGATCGTCCACCAACTCCACCCACAAGAATTGATAGTCGTAAAGGACATAATGTATCTCTTCCTAATATGCTCCATCCAGATGAAAAAGTTGCGAGTTTACCAAAATTCTTTGAGAAACCGCCCCCACCATATGAATATAAACATTTAACTTTATCTTCTATGATACCCTCTAAACCACCACCTGCATATCCTGGCACAGTACCAACGGTCCCTTTGAAACAAATTAATGATAAAGAGGAAGTTTCAGCGAGAGTGGTCACATCAAAACCAATGATAACTATATTAAAGGCAGAAGCAGGAGACGTTAACATTACTGGTGAAAGGACATTTGCTAGCCCGATGGTGTTGGAACATCGTTTTCAAAAATCAAAACGGCACCAAGCGTCAAGTCGACGAGCTGAACGATCTAAATTGGCCCAAGGACTCAGCAATAACCTGTCACCATCGAGAGAACTACCACCTAGCATAGATTCTAATGTCTTGGTGGCAATGATGAAATTACCACCACCACCGCCACCACCTCGCCGTCCAAGACTTCCGCCACCACCACCTGTTAATCGCCTACCGCCGCCACCACCTCCCCCCCACAATCCTATCTTCCACCAACAACTATACAGCGACGTCGACTACGTGTACTATCCTCTTCAAGATCCTCTAATATCACAACAGAACTATTTAGATCATAAACTTACGGAATCGAGGTTGTCTAACACCCACAAAAACTGTTTACAATACAGAAGTACCCCCTTCCTTTCTAATTCTTTATCCATTTCATCTACATACGGATCGGTTCAAAATCTATCCGATTCATACATACAAATACCTGGAGCCAGAACAAGTTGGTATTCCATGACTAGTAGAAATTCAGCAAGTAGCCATTCTATAAATCTAGAAAGACCTCTAGTTCCCATGACATTGCCAGAATCTATAATGTCTCGAACGAAATCTCATGAAAATATCTTCTTCGTTAAGGATATGCCTCGTCAAAAGACAAGGAGAATGCCGCCTCCGCCTCCCCCACCTTATGAACACAAAAAGAAAATCCCATCACATTTAAAGGATTACAGAAGTCCCAATTGCAGCAAATTCAACAGTGCCGTTAGTGTCAAAGGAAAATCTGGGAATTGCGATCTCGATATTAAAACTCTCCGTGAGAAAAGTAAGAACCTGGACTTACCACTTATAGCTGCGTTGTGCAATGATCGTTCATTGCTAAAACAAACGAAAGCATTTGGTGTTCCAAAATTAAACAAACAGACAAGTAGTGACTGTGAAAGTGACAGAAAAAGTGCCAAGCCTTCTCAAAACACCACCGATAACATAGATCTTAAGAATAAATATGATTTTAACACTAGGGGGGCAGTGACTGGGTCACAAAAGAAAACTTTAGTAAGAAATCCTACAGACAAACTTCCAGCATTGCCCAATTCTGAAACTCATACTCCTAGAGCAATGTCTAACACATATGTAATGCATCCAAATGTCAAACAAAAGAAATGTCAGCCGAGTTTATAA

Protein sequence:

>DPOGS214064-PA
MWGSNSAGSEKPIVYEEKKERMECADGVGRVIEVSRCDMRALCSVRGLGGEARALGAGARLLSLRMPGQPQPLHFVVEAKARVKELKMLAYAHIQLQGMTDTELFGLAIMQNGEYLFVDLESKLSKYAPKSWRSSHTHGLDANGKPLLELHLVVQFHVESPLLLHDQCGRHLYFLQLLENIRTRDVLPGEILLLLIGLALQAKYGDEDLYENQDYFKIEDFAPPSLTGEWVVSAIRACHREHHGLSKSDAEIRFIREVSLLPDTINSHRYRLKQSKTELEPGTVWLLVTAKGIKILPDNGPLSNFIWSSIGKLSFDRKKFEIRTEEGKITLYSSSEEKCKYLFALCKETHQFSMKISSKLNEILTKEEDERKVCFGYSKALNFHYCHNKNEQRISLISSTSSNTTSGIVSDRVQSEDELEIMIDTPPAPSTESLAFAHLLDCSNSYLIRHMPHDQHSLKTRSSLQLPMSKPRMKKNTEDTENICNNDISLDREGGSTSQAEETISLPDRVIDKSDSSPGSSKIKCTGSQCSSSCSTVILARGGLSTLSRASNASSLELGYSHTAQNSMISDNSTGAVDGEYTQDTASALYDGLGQPVTVAASSETSGVYTMGSSELTTGSKLYANSEGSRTEYSESTYDSYKIIKENDLADFDSVSSILKNKSKRTDASPNLLKSRISLTNDCVDGTSQFCSETRDLKQNMFRERTNSNVSALSFHGDGSDPADNKHNLLSASELSDLIVGRGVYPKNQSVSDTFDSVSDYVRLPLPFTGDSYLKGHEDTAPSDDNYPNFDRPPTPPTRIDSRKGHNVSLPNMLHPDEKVASLPKFFEKPPPPYEYKHLTLSSMIPSKPPPAYPGTVPTVPLKQINDKEEVSARVVTSKPMITILKAEAGDVNITGERTFASPMVLEHRFQKSKRHQASSRRAERSKLAQGLSNNLSPSRELPPSIDSNVLVAMMKLPPPPPPPRRPRLPPPPPVNRLPPPPPPPHNPIFHQQLYSDVDYVYYPLQDPLISQQNYLDHKLTESRLSNTHKNCLQYRSTPFLSNSLSISSTYGSVQNLSDSYIQIPGARTSWYSMTSRNSASSHSINLERPLVPMTLPESIMSRTKSHENIFFVKDMPRQKTRRMPPPPPPPYEHKKKIPSHLKDYRSPNCSKFNSAVSVKGKSGNCDLDIKTLREKSKNLDLPLIAALCNDRSLLKQTKAFGVPKLNKQTSSDCESDRKSAKPSQNTTDNIDLKNKYDFNTRGAVTGSQKKTLVRNPTDKLPALPNSETHTPRAMSNTYVMHPNVKQKKCQPSL-