Monarch geneset OGS2.0

DPOGS210814
TranscriptDPOGS210814-TA3381 bp
ProteinDPOGS210814-PA1126 aa
Genomic positionDPSCF300027 - 637130-650596
RNAseq coverage304x (Rank: top 37%)
Annotation
HeliconiusHMEL0035000.076.81% 
BombyxBGIBMGA007130-TA0.087.33% 
DrosophilaMyo10A-PD0.064.13% 
EBI UniRef50UniRef50_D6WX090.068.33%Putative uncharacterized protein n=4 Tax=Pancrustacea RepID=D6WX09_TRICA
NCBI RefSeqXP_969646.20.068.33%PREDICTED: similar to AGAP005213-PA [Tribolium castaneum]
NCBI nr blastpgi|2700123270.068.33%hypothetical protein TcasGA2_TC006465 [Tribolium castaneum]
NCBI nr blastxgi|2700123270.068.33%hypothetical protein TcasGA2_TC006465 [Tribolium castaneum]
Group
Gene OntologyGO:00058562.8e-27cytoskeleton
GO:00055156.4e-09protein binding
KEGG pathway 
InterPro domain[118-226] IPR0008572.8e-27MyTH4 domain
[441-535] IPR0119936.4e-09Pleckstrin homology-type
Orthology groupMCL10077 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210814-TA
ATGTCGGCGAGTGCGGAGTTCGCCGACATGTCGGACGACAGTCTTTCGGACAGCGACTCGATTGACCGACACGCCGACGTATGTCACTGTGAGGAAGTTAAAGGCAATTTACTATTGGAAATCTTAAAAGCGGACTCGTCACTCAAGAGTAAGGCGAAGCGCAACGAGTGGACGTGGAAGGCTCAGACAGATGTGGTCAAGTGGCAAGCGACGCCGCTGCGGGCTCCGCTCCTGCGCCTCCCGGCGGCGCTCGCCCCGCCCGCCCTGGAGTGCTTCACCTGCATCCGAGCCTACTGCGGGGACCTGCAGCCTCCCGAGCGAGCCATGCATCAGGATTTGACGGAAGTGAAATGTGTCTATACTGTTCTCATGCACTGCCACTCGGTGGCGGAGCTCCGTGACGAGGTGTACTGCCAGCTGATGAAGCAGACCACCTCCAACCGCTCGCACGCGCCCGACTCGTGCCAGCGAGCCTGGAGGCTCATGTCTATACTGTCCTCATACTTCACCTGCTCTGAGACGCTCAGACCGTTCCTGGTGGAGTATCTGTCGGCGGCGGCGGCGGACAGGCGGAGGCCGTGCCAGGGGACCGCGGCCGTGTGTCTCGCCAACTTCAGGAAGACCATGAGGTGCGGAGGCAGGAAGAACGTACCCAGCGTCGAGGAGGTGACGGCTGTGTCAGCTGGTCGGTCGGCTCGCCGGCAGCTGTACCGCCTGCCGGGAGGAGCGGAGAGGGTCGTCAACACCAGGTGCGCGACAGTCGTCCAGGACATAGTGGACGACCTGTGCGAACTGATCGGCGTGTGCAACCCGGCGGAGCGCGCGGAGTTCTCCCTGTACTGCATCGTGGCGGGCGACTCCCTCACCATGCCGCTGGCGGGCGACGAGTACGTGCTGGACGTCACCACGGAGCTGCAGCGCGCTCAGCATCCCTTCTACCTCATCTTCTGCCGCTCCGTGTGGCACCACCCGCTCAGGACCGACGCTCCGCCGTTGTACACTGAAGTGCTCTTCAATCAGGTTGCCCCAGACTATCTAGAAGGACTCCTGCTAGTGTTGCCAGGGGGAGGCGCTCCCGCGGCGGGGGTGCTTCGTGACGCAGCAGTGGTGGCAGCCCTGCTACACCGCGCGGCAGGCCTGCCAGACGCACCACACCCCAGGGACCTCAAGTTCCTTCTGCCGAAGCCTCTCCTAGCTCTGAAAGAGCCGCGTCCCAACAAGTGGGCGTCGTGGGTCGGCAACGAGTGGCCGACTGTGCGGACTCTGTCGCCTGCCGCAGCTAAATCCAAAGTTTTACAAGTATTATCTCGCTGGTCCCTGTTCGGGTCGTCTTTCTTCGCGGTGCGGCGCGTGCAGGGCGGCGAATGGCGCGAGCACGTGCTGGCTCTCAACAGGCGAGGCCTGCACCTGCTGCACCCCGCCACGCACGACACGGACGCTCACTGGCCCTACGCCGACCTCATCTCCACTAGGAAGGTCCGTTCCGAAGACGGGACTCTGTTCCTGGACGTGAAGTGCGGTTCCCTTCTGCAGCAGCGAGTGACGCGGCTCCAGGCGGAACAAGCTCACGAGATAGCGAGACTCATCAGGCAGTACATCGCCCTGCAGAGAGATAACAGGGAGGGAGATTCGAGGCACTCACCAGGCAAGGTTGCTGGTAACGTGACCCCGTCCGCCGCCGCTTGGAATCATATTTGTAAATCACAGAGTCATATCGGCGTGTGTAACCCGGCGGAGCGCGCGGAGTTCTCCCTGTACTGCATCGTGGCGGGCGACTCCCTCACCATGCCGCTGGCGGGCGACGAGTACGTGCTGGACGTCACCACGGAGCTGCAGCGCGCTCAGCATCCCTTCTACCTCATCTTCTGCCGCTCCGTGTGGCACCACCCGCTCAGGACCGACGCTCCGCCGCTGTACACTGAAGTGCTCTTCAATCAGAAAGGCATCACTGATTGGATGTCATTGGAAATCTTAAAAGCGGACTCGTCACTCAAGAGTAAGGCGAAGCGCAACGAGTGGACGTGGAAGGCTCAGACGGATGTGGTCAAGTGGCAAGCGACGCCGCTGCGGGCTCCGCTCCTGCGCCTCCCGGCGGCGCTCGCCCCGCCCGCCCTGGAGTGCTTCACCTGCATCCGAGCCTACTGCGGGGACCTGCAGCCTCCCGAGCGAGCCATGCATCAGGATTTGACGGAAGTGAAATGTGTCTATACTGTTCTCATGCACTGCCACTCGGTGGCGGAGCTCCGTGACGAGGTGTACTGCCAGCTGATGAAGCAGACCACCTCCAACCGCTCGCACGCGCCCGACTCGTGCCAGCGAGCCTGGAGGCTCATGTCTATACTGTCCTCATACTTCACCTGCTCTGAGACGCTCAGACCGTTCCTGGTGGAGTATCTGTCGGCGGCGGCGGCGGACAGGCGGAGGCCGTGCCAGGGGACCGCGGCCGTGTGTCTCGCCAACTTCAGGAAGACCATGAGGTGCGGAGGCAGGAAGAACGTACCCAGCGTCGAGGAGATCGGCGTGTGTAACCCGGCGGAGCGCGCGGAGTTCTCCCTGTACTGCATCGTGGCGGGCGACTCCCTCACCATGCCGCTGGCGGGCGACGAGTACGTGCTGGACGTCACCACGGAGCTGCAGCGCGCTCAGCATCCCTTCTACCTCATCTTCTGCCGCTCCGTGTGGCACCACCCGCTCAGGACCGACGCTCCGCCGTTGTACACTGAAGTGCTCTTCAATCAGGTTGCCCCAGACTATCTAGAAGGACTCCTGCTAGTGTTGCCAGGGGGAGGCGCTCCCGCGGCGGGGGTGCTTCGTGACGCAGCAGTGGTGGCAGCCCTGCTACACCGCGCGGCAGGCCTGCCAGACGCACCACACCCCAGGGACCTCAAGTTCCTTCTGCCGAAGCCTCTCCTAGCTCTGAAAGAGCCGCGTCCCAACAAGTGGGCGTCGTGGGTCGGCAACGAGTGGCCGACTGTGCGGACTCTGTCGCCTGCCGCAGCTAAATCCAAAGTTTTACAAGTATTATCTCGCTGGTCCCTGTTCGGGTCGTCTTTCTTCGCGGTGCGGCGCGTGCAGGGCGGCGAATGGCGCGAGCACGTGCTGGCTCTCAACAGGCGAGGCCTGCACCTGCTGCACCCCGCCACGCACGACACGGACGCTCACTGGCCCTACGCCGACCTCATCTCCACTAGGAAGGTCCGTTCCGAAGACGGGACTCTGTTCCTGGACGTGAAGTGCGGTTCCCTTCTGCAGCAGCGAGTGACGCGGCTCCAGGCGGAACAAGCTCACGAGATAGCGAGACTCATCAGGCAGTACATCGCCCTGCAGAGAGATAACAGGGAGGGAGATTCGAGGCACTCACCAGCCTTCATCAGCCAATGA

Protein sequence:

>DPOGS210814-PA
MSASAEFADMSDDSLSDSDSIDRHADVCHCEEVKGNLLLEILKADSSLKSKAKRNEWTWKAQTDVVKWQATPLRAPLLRLPAALAPPALECFTCIRAYCGDLQPPERAMHQDLTEVKCVYTVLMHCHSVAELRDEVYCQLMKQTTSNRSHAPDSCQRAWRLMSILSSYFTCSETLRPFLVEYLSAAAADRRRPCQGTAAVCLANFRKTMRCGGRKNVPSVEEVTAVSAGRSARRQLYRLPGGAERVVNTRCATVVQDIVDDLCELIGVCNPAERAEFSLYCIVAGDSLTMPLAGDEYVLDVTTELQRAQHPFYLIFCRSVWHHPLRTDAPPLYTEVLFNQVAPDYLEGLLLVLPGGGAPAAGVLRDAAVVAALLHRAAGLPDAPHPRDLKFLLPKPLLALKEPRPNKWASWVGNEWPTVRTLSPAAAKSKVLQVLSRWSLFGSSFFAVRRVQGGEWREHVLALNRRGLHLLHPATHDTDAHWPYADLISTRKVRSEDGTLFLDVKCGSLLQQRVTRLQAEQAHEIARLIRQYIALQRDNREGDSRHSPGKVAGNVTPSAAAWNHICKSQSHIGVCNPAERAEFSLYCIVAGDSLTMPLAGDEYVLDVTTELQRAQHPFYLIFCRSVWHHPLRTDAPPLYTEVLFNQKGITDWMSLEILKADSSLKSKAKRNEWTWKAQTDVVKWQATPLRAPLLRLPAALAPPALECFTCIRAYCGDLQPPERAMHQDLTEVKCVYTVLMHCHSVAELRDEVYCQLMKQTTSNRSHAPDSCQRAWRLMSILSSYFTCSETLRPFLVEYLSAAAADRRRPCQGTAAVCLANFRKTMRCGGRKNVPSVEEIGVCNPAERAEFSLYCIVAGDSLTMPLAGDEYVLDVTTELQRAQHPFYLIFCRSVWHHPLRTDAPPLYTEVLFNQVAPDYLEGLLLVLPGGGAPAAGVLRDAAVVAALLHRAAGLPDAPHPRDLKFLLPKPLLALKEPRPNKWASWVGNEWPTVRTLSPAAAKSKVLQVLSRWSLFGSSFFAVRRVQGGEWREHVLALNRRGLHLLHPATHDTDAHWPYADLISTRKVRSEDGTLFLDVKCGSLLQQRVTRLQAEQAHEIARLIRQYIALQRDNREGDSRHSPAFISQ-