Monarch geneset OGS2.0

DPOGS206306
TranscriptDPOGS206306-TA3684 bp
ProteinDPOGS206306-PA1227 aa
Genomic positionDPSCF300082 - 978285-985438
RNAseq coverage3034x (Rank: top 4%)
Annotation
HeliconiusHMEL0102720.091.86% 
BombyxBGIBMGA014126-TA0.089.18% 
Drosophilaabba-PC0.065.49% 
EBI UniRef50UniRef50_D6X3080.061.51%Putative uncharacterized protein n=7 Tax=Arthropoda RepID=D6X308_TRICA
NCBI RefSeqXP_001808548.10.061.38%PREDICTED: similar to AGAP007135-PA [Tribolium castaneum]
NCBI nr blastpgi|1892410700.061.38%PREDICTED: similar to AGAP007135-PA [Tribolium castaneum]
NCBI nr blastxgi|1892410700.062.58%PREDICTED: similar to AGAP007135-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055154.6e-09protein binding
KEGG pathway 
InterPro domain[950-1097] IPR0110421.5e-36Six-bladed beta-propeller, TolB-like
[3-58] IPR0130833.8e-10Zinc finger, RING/FYVE/PHD-type
[1059-1086] IPR0012584.6e-09NHL repeat
Orthology groupMCL16034 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206306-TA
ATGGAGCAGTTCGAATCACTGCTAACTTGCTGCGTTTGTTTGGATCGATATAGGAATCCAAAGCTGTTACCATGCCAGCACAGCTTTTGTATGGAGCCATGTATGGACGGCCTTGTCGATTATGTCCGGAGACAGGTAAAATGTCCAGAATGTCGTGCAGAGCATCGTATTCCTTATCAAGGAGTTCAAGGATTTCCAACAAATGTTACTTTACAAAGATTTTTGGAACTCCATGCTCAAATTGCTGGAGAACTACCCGATCCAACAGCAGGGCAAGTTATGGAAAGATGCAATGTTTGCTCAGAAAAGGCATATTGTGCGCCCTGTGCACATTGCGACAAAAAAGTTTGTGAAGATTGCAAATCTGCTCATATGGAAGTATTACGAAGAGAAATAGCTAGAATCAACAATCAAATTCGGCGTGGTGTAAACCGATTGCAAGATATTTTGGCTGTAGTTGAACGAAATACAGCAAATCTTCAAACAAACTGTGGTGCAGTAGCTGGAGAAATCGATGAAATTCATAAAAGACTTGCAAAAGCTCTCAAAGACCGAACTGATTTTTTAAGGACAGAAGTTGACCGATATCTTGCCACTGAACTTAGAAATCTAACGCATCTTAAAGATAATTTAGAGTTAGAACTAAGCAATATTCAGAGTAATTGTGATCTAGCTGATAAATATATGAATGATGACGTCGATTGGGAGGATACCGAATTAGTTGATACGAAAGAAATTTTTCTGAAAACGGTAGAATTTCTGAGAAACTTCGACTATGAAGCTGGTGATTACAATCGCAGAGTTCGTTTCATAATGACTCATGATCCAAACCAACTAGTTATGCATGTTGCTAGTTATGGAGAACTTAACATAACAAATCCAAATGCGTATTCAGCTGGCTTACAACAATCTCAAGGTCTTACTAGATCTAAAAGTGATCATCGTTTAGCGACACAATTCCGTCAACAAGAAGAAGCCAAGGGTTATATGGAAAATGATGAACCCATATTAGGTGGACGAAAATTTGGAGAACGTCGGCCTCCTCCACCGGAAAAGCACACGCGAGACTATAGTGCTACGGATGATTACAGCGGTTATGAATCAGAACACAGACCATCTCGTCGTTTCCGTTCACGTTTCGTAAGAAGTCATCAACAAGATAACGATTCTGATACCGAACAAAGCACGAGGACTGTAAAATCTGAACACAAGGAAAAGGAAAAGGAAAAAGTTGCAGATACCGAAGATGCAACACGTGGTCCACTAAGTGGAATATTTAGGCTTAGCGATTGCCCACGTGTAATACAACGGATACTTGACGTAGACAGTGGAAAGAAAAAAGAAAAAAAGGAGCCACCTCCGCCGCCAAAACCAGTGCAACCAACCCCACAACCTCAACGTCGGCCACCACCAGTAAGACAACAGAGTGAAGATGACGAAATCTCAAGATTAAAGAGACAAAATAAAGGTGCAGCGTCATCACAAGAACCAGAACGCCTTCCTTCACGACCTGTTGAAGAAGAGAGACCCGCTGTTAATCGTAAACCACCTACTCCAGCACGCGAGGCATCCAGTGAAGGTGAATCCGATGAAGAATCGGTTGGATCTCTTCAAAGAAATCAGCGTAAGAGCTCAGTACAAACACAAAAACCAACTGCTACGAGAAGGCCTTCTGCGTCTGATACATCTTCCACTCATCGACCCGCGGCTCGTGCTACTAGTACTGAATCTAGTGCATCTACCGAAAGTTCCGGCTCGGCGGTAAAACATACAGGTGCAATTCTATCAATCGCGGAGTTGAAAGCCAAGTACAGCATTGATGGGCCTTCTCCAAAACCGATCTCTCGTATTTTGTCCGCAGGCAATGAGAGATCTGCTCCAGTGACAACAAATGGAACAGTTAGCGGCGCCCAACGAGTTCAAAGTCGATTCGTTGGATCCCAACGACCGACACCTGCCCCGGCGCCCGCCGAGCCGGCTCCCGAAGACTCCGATACAAGTTCTGAAGAGGAAACCGACTCTTCTGAGGAGAGCGAGGAGGAGCCTGTAACGCAAAGGAAGCCCGAGAGTCAGGCGATGGCGCGTAGTGACATTGGCCCCCTTCTAGCAAGAAGTACCAATGCTCGAAATGATGCCCATGATAACAAAAGTAAAGAAACACCAGCTCAAACGCGATACCGAACTCGCCAATCATCGCAAACGGAAGAGGAGCCGTCTCCTCGATACGGTGCCAGCAGTTCATCATATAGTAGTCGCTATGGTAGTAAGCCTAAAGACGAGGAATTAACATCTTCTTTAGATGATGAATCAAAATACCCCACGGCTCGGTCCAGGTACCTTGCCTTAAAAGAACGACGCAATCGCCTTGCACGAAGTAAAAGTAGCCATACTGGCTTCGGAGCAGGTGACGATGATGACCAAGATGAACCAGTTTCTCCTACCACTGCTTCACCATCAGCTTATCTTGCTGCTCGATACGGCTCCGGCACTGGAGGTTCGGAGTTGTCTCGTAGCCGTTCTTCTCATGCGCTCAAGTCTAGAGAAAGCTCCCCCGAACGTCCAGTTACTGGAGAGAAGGACGGAGCAGCCTTAAGCTCTTGGGCGAGATACTTAAAAAATAAATATGGTTCGCGAGGTAAAGATCGTGATACAAGCGGGACTTCCTCGAGTTCATCTCGTCGCCTGTCTCTCGGGTTGCCATTACGCTCAGCTAATGAGCTTGCCAGTTCTGATGACGATTCAAAAAACGCGGCAGGCTCCCCCATCTCCCCTACGGCGGCTACAGCAGCGGTAGCAGGTTTCGCAGCAGCAGGTTCCTCCCCTAGGAGCCAGTATCTGCAGAAGCGCCGTTTACAATTCAGTGTTGGGAGTCGGGGGAGCGAACCCGGTTGTTTCACTTGGCCACGTGGTATCGCTGTAGGACCTGAAAATATAATGGTCGTGGCTGATTCATCTAATCATCGTGTTCAAGTATTTGATTCAAATGGTATATTTATAAAAGAGTTCGGTCAGTACGGTAGTGGAGAGGGGGAGTTTGACTGTCTGGCTGGCGTCGCCGTCAATCGTATTGGACAATATATCATAGCTGACAGATACAATCACAGAATACAGGTATTCGACCCAGCCGGTCGGTTTTTGAGGTCATTCGGTAGTCAAGGAACTGGTGATGGCAAGTTCAATTATCCCTGGGGTATCACCACAGATGCACTCGGATTTATATATGTCTGTGACAAGGAAAACCATAGAGTACAGGTATTCCAATCCGATGGCACATTTGTGGGCAAATTTGGTAGTTTCGGCTCGAAGTTAGGACAACTGGAACACCCTCACTATATAGCTGTTTCAAGCACAAATCGTGTTCTTGTGTCTGACTCTAACAACCACAGGATCCAAGTCTTCGATGTGAATGGACGTGTCCTTTCTTCATTTGGAGAAGAAGGCTCTGAAGATGGACAGTTTAAATTCCCAAGGGGTGTGGCAGTGGATGACCAAGGTTACATAGTTGTAGCAGATTCTGGAAACAACAGAATTCAGATCTTTCACCCTGACGGGACGTTCCTTAGAGCTTTTGGGTCCTGGGGTTGCGGTGATGGGGAATTCAAAGGGCTGGAGGGTATTGCCGTCATGTCTGGTGGAAACATCATTGTTTGCGATCGGGAGAATCATAGGGTGCAAGTTTTTTAA

Protein sequence:

>DPOGS206306-PA
MEQFESLLTCCVCLDRYRNPKLLPCQHSFCMEPCMDGLVDYVRRQVKCPECRAEHRIPYQGVQGFPTNVTLQRFLELHAQIAGELPDPTAGQVMERCNVCSEKAYCAPCAHCDKKVCEDCKSAHMEVLRREIARINNQIRRGVNRLQDILAVVERNTANLQTNCGAVAGEIDEIHKRLAKALKDRTDFLRTEVDRYLATELRNLTHLKDNLELELSNIQSNCDLADKYMNDDVDWEDTELVDTKEIFLKTVEFLRNFDYEAGDYNRRVRFIMTHDPNQLVMHVASYGELNITNPNAYSAGLQQSQGLTRSKSDHRLATQFRQQEEAKGYMENDEPILGGRKFGERRPPPPEKHTRDYSATDDYSGYESEHRPSRRFRSRFVRSHQQDNDSDTEQSTRTVKSEHKEKEKEKVADTEDATRGPLSGIFRLSDCPRVIQRILDVDSGKKKEKKEPPPPPKPVQPTPQPQRRPPPVRQQSEDDEISRLKRQNKGAASSQEPERLPSRPVEEERPAVNRKPPTPAREASSEGESDEESVGSLQRNQRKSSVQTQKPTATRRPSASDTSSTHRPAARATSTESSASTESSGSAVKHTGAILSIAELKAKYSIDGPSPKPISRILSAGNERSAPVTTNGTVSGAQRVQSRFVGSQRPTPAPAPAEPAPEDSDTSSEEETDSSEESEEEPVTQRKPESQAMARSDIGPLLARSTNARNDAHDNKSKETPAQTRYRTRQSSQTEEEPSPRYGASSSSYSSRYGSKPKDEELTSSLDDESKYPTARSRYLALKERRNRLARSKSSHTGFGAGDDDDQDEPVSPTTASPSAYLAARYGSGTGGSELSRSRSSHALKSRESSPERPVTGEKDGAALSSWARYLKNKYGSRGKDRDTSGTSSSSSRRLSLGLPLRSANELASSDDDSKNAAGSPISPTAATAAVAGFAAAGSSPRSQYLQKRRLQFSVGSRGSEPGCFTWPRGIAVGPENIMVVADSSNHRVQVFDSNGIFIKEFGQYGSGEGEFDCLAGVAVNRIGQYIIADRYNHRIQVFDPAGRFLRSFGSQGTGDGKFNYPWGITTDALGFIYVCDKENHRVQVFQSDGTFVGKFGSFGSKLGQLEHPHYIAVSSTNRVLVSDSNNHRIQVFDVNGRVLSSFGEEGSEDGQFKFPRGVAVDDQGYIVVADSGNNRIQIFHPDGTFLRAFGSWGCGDGEFKGLEGIAVMSGGNIIVCDRENHRVQVF-