Monarch geneset OGS2.0

DPOGS215593
TranscriptDPOGS215593-TA1836 bp
ProteinDPOGS215593-PA611 aa
Genomic positionDPSCF300097 + 217656-219491
RNAseq coverage339x (Rank: top 34%)
Annotation
HeliconiusHMEL0169270.086.05% 
BombyxBGIBMGA000351-TA0.080.05% 
Drosophilayrt-PA5e-16878.30% 
EBI UniRef50UniRef50_Q7QB865e-17375.38%AGAP004136-PA n=2 Tax=Bilateria RepID=Q7QB86_ANOGA
NCBI RefSeqXP_001847268.18e-17383.08%band 4.1-like protein 5 [Culex quinquefasciatus]
NCBI nr blastpgi|3479713082e-17275.38%AGAP004136-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700085019e-16961.15%hypothetical protein TcasGA2_TC015016 [Tribolium castaneum]
Group
Gene OntologyGO:00054883.2e-36binding
GO:00055153.6e-16protein binding
GO:00198982.2e-05extrinsic to membrane
GO:00080922.2e-05cytoskeletal protein binding
GO:00057372.2e-05cytoplasm
KEGG pathway 
InterPro domain[2-188] IPR0197491.7e-63Band 4.1 domain
[75-182] IPR0143523.2e-36FERM/acyl-CoA-binding protein, 3-helical bundle
[78-183] IPR0197488.4e-36FERM central domain
[2-76] IPR0189793.4e-20FERM, N-terminal
[29-41] IPR0197501.7e-19Band 4.1 family
[183-263] IPR0119933.6e-16Pleckstrin homology-type
[274-317] IPR0148473e-12FERM adjacent (FA)
[193-267] IPR0189808.6e-09FERM, C-terminal PH-like domain
Orthology groupMCL12320 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215593-TA
ATGTTGCTTGATGATACTGACTTATCTATTAATCTATCTAAAAAGGCTAGTGCAGGAGATCTTTATGAACAAGTCTTTTATTCTCTTGATTTGATTGAGAAAGATTATTTTGGCTTACAATTCACTGATACAAATAATGTTAAGCATTGGTTGGACCCAACAAAAACCATTAAAAAGCAAGTAAAAATAGGACCACCTTACACTTTAAGACTGAAGGTTAAATTTTATTCATCTGAGCCTAACAATTTAAGAGAAGAGCTCACAAGATATCAGTTTTTCCTACAATTGAAAAAAGATATCTTAGAAAGTAAGCTGGAATGCCCACATTCTACTGCAGTTGAATTGGCAGCTTTGGCTTTGCAATCTGAATTGGGTGATTTTGATGAAACTATACACACCCCAGCTACAGTATCTGAATTCCGTTTTGTCCCGAATCAAACAGAAGAAATGGAAATCGAAATATTAGAAGAATTTAAAAAATGCAAAAATCTCACACCAGCCCAAGCTGAAGTCAACTATCTTAATAAAGCAAAGTGGCTTGAAATGTATGGTGTTGATATGCACATTGTGTTGGGAAAAGATGGGTGTGAATATCATCTTGGTTTAACACCAACGGGAATTCTTGTTTTTGAAGGACCCCAGAAAATCGGTCTCTTCTTTTGGCCAGTCGAAGACGACGACCAAGGACGCGAACAGGAACATACATTTGTATTTCGACTACACAATGAGAAGGCCTGTAAACATCTTTGGAAATGTGCTGTAGAACATCACACTTTCTTCCGCTTACGAGCTCCAGTGAAAGGTCCATCGGCAAGACAAAACTTCTTCAGAATGGGCTCAAGATTTCAATATTCAGGTAAAACTGAATACCAAACCACACAACAAAATCGTGCGCGTAGAACTGTTCAATTTGAGCGTAGGCCTAGTCAGCGGTTTGCTCGTAGACAGAGTCATGTGTTAAGGGAACGAGAAAAGCAAAATACATCTACAAAGACAGAAACGGTTGCACAACCAGAGCCGAGTACTTCCGCAGAAACTACGGCGGTGGTGAATGAAAATGTTAATGCTCTTGATACATTATCTCGTAAGAGCAGTGCCAAATCACAAAAGTCTTCGCTGATTGATAGCGATTATAAAATGACTGAAGTAGGAGAACCATCGACCAAGAATCTTGTAGATGACGACACGCAAGACGAAGTTCACATATCAAAGGAAGACGCTATTAACAATAAGGTTCTGTTTAAAATGGACAAGTGTGAAGAAAAGAAAAACAATTTAACACAACTGGTTATTAATAAGCCGCCTTGCAATTGTTCTCCAGTTGCAGAGTTTAATCCATTAAATGATCTCTTAATGAGCTTGGTGAAGGACAAATTGAACACGGACGACGCTAAAAACGTAAACACAAAAGACCAGGATTCAGTAGACAACGAGGTACCTAACAATCAAAATAAATTTATGCTTGGCTCAAAGAAAAATCTCCCACCTGGTCAGCTAAAATGCAATAACATACTCAAAGCTCGGGAGAATGAAGTTAAGATAATAAACGAATCGTCAAACATGAACCTATCAATTCCGTCAACGCCGTCACCTAAGATAATAAATGAAAATAATACCCAATACGTATCTATTGTTGTTGTTGAACCTCCGCATTCACAAGTGAAGGTAGCGTCACCTAAACCGGTGGAGAGACAAGAGGAAATCGCTCCTAAATCCCAACCTATATCATCTCTATCACCGTGGCTGGTGACGTCAGAGCCCAGTTCACCGTCCGGTATCGGAGAGAAGGAGATAACTATCATTCACAGAAAATCAGTTATAACAACCCAACTTTAA

Protein sequence:

>DPOGS215593-PA
MLLDDTDLSINLSKKASAGDLYEQVFYSLDLIEKDYFGLQFTDTNNVKHWLDPTKTIKKQVKIGPPYTLRLKVKFYSSEPNNLREELTRYQFFLQLKKDILESKLECPHSTAVELAALALQSELGDFDETIHTPATVSEFRFVPNQTEEMEIEILEEFKKCKNLTPAQAEVNYLNKAKWLEMYGVDMHIVLGKDGCEYHLGLTPTGILVFEGPQKIGLFFWPVEDDDQGREQEHTFVFRLHNEKACKHLWKCAVEHHTFFRLRAPVKGPSARQNFFRMGSRFQYSGKTEYQTTQQNRARRTVQFERRPSQRFARRQSHVLREREKQNTSTKTETVAQPEPSTSAETTAVVNENVNALDTLSRKSSAKSQKSSLIDSDYKMTEVGEPSTKNLVDDDTQDEVHISKEDAINNKVLFKMDKCEEKKNNLTQLVINKPPCNCSPVAEFNPLNDLLMSLVKDKLNTDDAKNVNTKDQDSVDNEVPNNQNKFMLGSKKNLPPGQLKCNNILKARENEVKIINESSNMNLSIPSTPSPKIINENNTQYVSIVVVEPPHSQVKVASPKPVERQEEIAPKSQPISSLSPWLVTSEPSSPSGIGEKEITIIHRKSVITTQL-