Monarch geneset OGS2.0

DPOGS211189
TranscriptDPOGS211189-TA1209 bp
ProteinDPOGS211189-PA402 aa
Genomic positionDPSCF300007 + 653576-655319
RNAseq coverage845x (Rank: top 15%)
Annotation
HeliconiusHMEL0124366e-14964.68% 
BombyxBGIBMGA003177-TA2e-16366.25% 
DrosophilaCG5114-PA1e-6837.28% 
EBI UniRef50UniRef50_Q7Q2X52e-7142.20%AGAP011387-PA n=5 Tax=Culicidae RepID=Q7Q2X5_ANOGA
NCBI RefSeqXP_393830.32e-8142.14%PREDICTED: similar to angio-associated, migratory cell protein [Apis mellifera]
NCBI nr blastpgi|3800159579e-8142.62%PREDICTED: angio-associated migratory cell protein-like [Apis florea]
NCBI nr blastxgi|3504078433e-8341.67%PREDICTED: angio-associated migratory cell protein-like [Bombus impatiens]
Group
Gene OntologyGO:00055156.5e-59protein binding
KEGG pathway 
InterPro domain[14-396] IPR0110466.5e-59WD40 repeat-like-containing domain
[177-395] IPR0159433.6e-37WD40/YVTN repeat-like-containing domain
[60-95] IPR0197817.4e-08WD40 repeat, subgroup
[56-95] IPR0016806.6e-07WD40 repeat
Orthology groupMCL11907 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211189-TA
ATGAGAGGACTATCAGAAGATACACCGCCATCCTCTATAAATGGCGATGAAATCACCGGGATGGATGACGACGGGATATATTTCGAGGAGATGGAAGAAATACAATTTGATGATGAGGAAATGGTTGATGACCAAGAGGAGGAGGATACGGAAATGATTAGACCAGAAGACCATGCCATTACAGTGTTTAACAAACATAACGGTTCAGTTTTTTGTTGTGACTTTCATCCAAATGGTAAGATTGCTGTAACCGGTGGGGAAGATGACAAAGCATACGTGTGGTCTACAGAAACAGGGGATGTATTAATGGATTGCATAGGACATAAGGATTCAGTTATCTTCGTCGGATTCAGCTTTGATGGGACCTTTCTAGCTACAGTTGATATGTGCGGATTAATAAAAGTATGGAAGGTTAATTTGGAAGAAAATCAACAAGAACCATGGTCTGTGGTATTTGAATATGAGGCGGATGATCTAAGTTGGGGTTCATGGCATTTTGGAGCTAGAGTACTCATATGCGGGGCAGTTACAGGTGATATATACATTTTCAAAATTCCATCTGGAGATACCAAGGTTCTACAAGGGCACAACATCAGAACTGAATGTGGGAAGATGTTCCATGATGGTGTCCGTCTTGCAGCTGGTTATGAAGATGGTACTGTGAAGGTTTGGGATCTTAAAACGGCCACTGTTGTATCACAGATACCACCTGGAATCCATCAGATAAGAGTCACAGCTGTAGATACACATCCAGATAACAGTCTCATGTTATCTATAGCTACTGATGGTAAAGCAGTCATGACAACATCAAGTAACGGCAAAGTCGTTGCACAGATGGAAGCAGAAAATGATTTAGAAGTTGTTGCTTTTTCACCGGACCCACAGCTAGGATATTTTGCCTTAGGTACTCTTAATGGCTCGGTGACGATATGGGACACGGCGCGGCAAATGTTACGGCACCATTGCGCCAAATCGCAGGAGTCCGACGGCGTCACTAAAATGCTATGGATCAAAGACGAAGTAGTTACTGGCTGTCTGGACGGATCCGTGCGCGTGTACGAAGCGCGATCCGGCAATCGACGACTCGTCCTCACAGGACACTGGTCGGAAATACTGGATCTTACTTACAATGAAAAAGAAAAACTCATACTGACAACCTCAGATGACGGCACCGCGAGGATATTCAAATACAACGAAAAAACCGAATAG

Protein sequence:

>DPOGS211189-PA
MRGLSEDTPPSSINGDEITGMDDDGIYFEEMEEIQFDDEEMVDDQEEEDTEMIRPEDHAITVFNKHNGSVFCCDFHPNGKIAVTGGEDDKAYVWSTETGDVLMDCIGHKDSVIFVGFSFDGTFLATVDMCGLIKVWKVNLEENQQEPWSVVFEYEADDLSWGSWHFGARVLICGAVTGDIYIFKIPSGDTKVLQGHNIRTECGKMFHDGVRLAAGYEDGTVKVWDLKTATVVSQIPPGIHQIRVTAVDTHPDNSLMLSIATDGKAVMTTSSNGKVVAQMEAENDLEVVAFSPDPQLGYFALGTLNGSVTIWDTARQMLRHHCAKSQESDGVTKMLWIKDEVVTGCLDGSVRVYEARSGNRRLVLTGHWSEILDLTYNEKEKLILTTSDDGTARIFKYNEKTE-