Monarch geneset OGS2.0

DPOGS200805
TranscriptDPOGS200805-TA1464 bp
ProteinDPOGS200805-PA487 aa
Genomic positionDPSCF300249 - 48388-51483
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0158752e-1342.45% 
BombyxBGIBMGA011035-TA5e-4737.63% 
Drosophila% 
EBI UniRef50UniRef50_E9IE795e-4468.60%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IE79_SOLIN
NCBI RefSeqXP_970252.22e-2328.23%PREDICTED: similar to formin 3 CG33556-PB [Tribolium castaneum]
NCBI nr blastpgi|3228000092e-4368.60%hypothetical protein SINV_03498 [Solenopsis invicta]
NCBI nr blastxgi|3228000092e-4068.60%hypothetical protein SINV_03498 [Solenopsis invicta]
Group
Gene OntologyGO:00082704.2e-05zinc ion binding
GO:00036764.2e-05nucleic acid binding
KEGG pathway 
Orthology groupMCL26480 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200805-TA
ATGACTGCGAACTATATTGCCAGTGTCCCTAAACTAAAAGGCAGGGAAAATTATGATGAGTGGAGCTTCGCGGCTGAAAATTTATTGGTTCTTGAAGGAATGGACAATTACATCAAGCCAACGGCAGGTTTTGAGGTTAAGCCAGCAGAAGATGCAAAAACGAAGGCAAAGCTTATATTGACTGTGGACCCATCGTTATACGTGCATATTAAGAATACCAAGAGCGCAGCAGAATTATGGACTACATTAAAGACCATGTTTGACGACTCTGGTTTCTCACGCAAAATAACATTGCTACGACATTTGATCTCTATACGCCTAGATAATTGTGATTCAATGGCTACATATGTGACTCAAATGGTCGAGACAGCTCAACGGTTGAACGGCACAGGTTTTACAATCACCGATGAATGGGTGGGTTCATTATTATTAGCAGGTTTGTCAGATCGTTATTCTCCCATGATAATGGCAATTGAGCATTCGGGCATTTCGATTACTGCAGATGTCATTAAGTCGAAATTGCTCGACATGGAAGTGACCAGAGACAATGACGCCGGTGCTGCATTTGCCGCGCGAAATAATCATTCGTTCAATAAGTCAAAGAAAGGCGGTCCCGGTCCTTCAACGTCAGTTTCAAACAAGAAAATAACAGCAGCAATGACAGATTCATCGAAAACAATTACATGCTACAAATGTAAAGAAGATGGTCATTATCGAAATCAGTGCCCTTTATTGAAGAAAAACAATATTAAATGTGTTTTTAATGTCGTTTTCCTGAATGGAAAGTTCAATAAAACTGAATGGTACGTGGATTCCGGCGCCAGCGCTCATATGACGGCGAATGAATCTTGGAAGCTCGTCAGCGGTCTACAGCTGGGTGACCAGAAACCCTCTCAATTACTTCGCAAGATGCGAGAACTAAGCGCCGGTATGATTACGAACGAAGGGCTCCGTATCGAGTGGCTTAACCATTTACCTACTCAGATACGCGTTGTTCTATCCGTAAATACCGAGTCATCACTCGACACACTCGCCGTCATGGCCGACAAGATGGCGGAATACTCCGAGCCCGCGATGATCGCCGCTGTATCGACCGCAACAACAACTACGAACGACGCCGTATCAACACAAATAGCAATATTATCGAAGCAGCTAGAAAAATTGTCGCTAGAAATCAACGAAATACGCGGCCGCTCCACACATCGTCAATACCGTCGCTATCGCTCTCAGTCAAGGCCACGATCCAATTCAAACTCAACAAGAAACAAATCATCTGTGAAGCCCGGCGATGCCACGTGGGAANAAGTAGGAGAACAAGGGCCACCGCTGCAGCCAAAGAAGCGGCAGCCGCTAGGAGGAAAGGAGCCCAACCACCGACGACGACAAAAGCCCCCGCCACTCAGGAGTGGACCAAAGTGGCGACCACGAAGAAGGTCAAGGGGCAAAGCAAAGATTAGAGGGTAG

Protein sequence:

>DPOGS200805-PA
MTANYIASVPKLKGRENYDEWSFAAENLLVLEGMDNYIKPTAGFEVKPAEDAKTKAKLILTVDPSLYVHIKNTKSAAELWTTLKTMFDDSGFSRKITLLRHLISIRLDNCDSMATYVTQMVETAQRLNGTGFTITDEWVGSLLLAGLSDRYSPMIMAIEHSGISITADVIKSKLLDMEVTRDNDAGAAFAARNNHSFNKSKKGGPGPSTSVSNKKITAAMTDSSKTITCYKCKEDGHYRNQCPLLKKNNIKCVFNVVFLNGKFNKTEWYVDSGASAHMTANESWKLVSGLQLGDQKPSQLLRKMRELSAGMITNEGLRIEWLNHLPTQIRVVLSVNTESSLDTLAVMADKMAEYSEPAMIAAVSTATTTTNDAVSTQIAILSKQLEKLSLEINEIRGRSTHRQYRRYRSQSRPRSNSNSTRNKSSVKPGDATWEXVGEQGPPLQPKKRQPLGGKEPNHRRRQKPPPLRSGPKWRPRRRSRGKAKIRG-