Monarch geneset OGS2.0

DPOGS202657
TranscriptDPOGS202657-TA4515 bp
ProteinDPOGS202657-PA1504 aa
Genomic positionDPSCF300039 - 124255-133544
RNAseq coverage415x (Rank: top 29%)
Annotation
HeliconiusHMEL0022293e-15446.65% 
BombyxBGIBMGA000855-TA0.072.84% 
DrosophilaCG43154-PC0.046.35% 
EBI UniRef50UniRef50_E2BXX00.054.63%Breast carcinoma-amplified sequence 3 n=3 Tax=Neoptera RepID=E2BXX0_HARSA
NCBI RefSeqXP_001603193.10.054.60%PREDICTED: similar to breast carcinoma amplified sequence [Nasonia vitripennis]
NCBI nr blastpgi|3503986850.054.14%PREDICTED: hypothetical protein LOC100746524 [Bombus impatiens]
NCBI nr blastxgi|3838640670.036.57%PREDICTED: uncharacterized protein LOC100877372 [Megachile rotundata]
Group
Gene OntologyGO:00055151.4e-07protein binding
KEGG pathway 
InterPro domain[536-666] IPR0221757.3e-24Breast carcinoma amplified sequence 3
[145-439] IPR0110461.4e-07WD40 repeat-like-containing domain
Orthology groupMCL13118 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202657-TA
ATGTCGGCGGAGTCTCCGCGGCACGCGCGCTCGGGTGGACCTCTCACGGTGCCCTCCCAGCCGCCCAGCGACCGCAGCATCATTGACGCCGTGTCCGGCTTCATTAATGACGTCACACTCTCCTCTTCATCCACTGTTGATCCAAAGGACGTTATACAGTGGGCTAGGTTTGAAACAGCAGATATAAATGAACCTACACAAGAAGGGGATGGTGATAATGATGTTCCACCATTACTGCTCATCCTGGGCTATGGGTCAGGAGTGCAGGTCTGGTTGATTCCCTCCAATGGGGAGGCGCAGGAGGTACTGTCGTGGAGACAAGGCACTGTGAGGGTGCTCCGTATACTGCCAACTCCGCAGCACGGCGACTGCTTTGCATCGAAAAGGCCCCTCATAGCTTTGTGTGATTCTGCCAGTCCAGGACCAGCCTTTTGTTCATTGATCTTCTTATCCATTAGGGGTGGGGAGCAGGTGAAGAGTATTAAGTTCAAGAACCCCATCCTGGATGTGTTGGCCAATAAGCGTTCAGTGGTTGTGTCATTCTCTGAACGTTTTGCTGTTTTTGATGCTGCTACTTTAGAGGACCGGCTGGCTGTCACCACATGTTATCCGTGCCCATGTCCACTAGGAGGGAGCGCTCCTATCAACCCTTTGACCCTCGGGGACCGCTGGCTGGCTTATGCTGAGAAGAAACTCAACCCATCAAAACGTAGCAGCGGAGGATGTGAAACTGAAGGGGTAACGAGTTATACGGCTACTGTACTCCACGCGGCTAAATCTCTCAGTAAGGGTCTGCGAGGCCTGGGCGAGACGGTGGCGCATAGTTTGGCGGGCGGTCGCAGCACGTCGCAGTCACCGTCACCACCACACGCTGATATACAGCAGCCGGGGGTCGTCACTATATTGGACATCGAGGGTAATGAAGATGAAGATAGTCAAGACTGCGAGGAGCCCTGCGACCCTATAGTGGCTCACTTCATCGCTCACTCGGAGGCGATTATAGCTCTGAAGTTCGACCCCAGCGGCATGTTGCTGGTGACCGCCGATCGGAGAGGTCACGACTTCCACGTGTTCCGTATAAACCCCCACCCCTGCGGACCCAGCCTGGCCTCCGTGCACCACCTGTATATACTACACAGAGGCGATACTACGTCCAAAGTACAGGATATATGTATATCCGGTGACTCCCGTTGGGCGGCCATATCGACCCTCCGGGGTACGACGCACGTGTTCGCGATCAGTCCCTACGGCGGGGCGATCGGCGTCCGCACTCACACCCAGCCGCGGCTGGTGAACAGACTGAGCCGGTTCCATCGCTCGGCCGGCCTGCCCATACATCATACATCCCACGTCCCGCCCGCCGCTCACAGCCCAGTTCTAGAGTCCGGGGCGTGGTTTCCCAACCCGCGTCTCCCGCCGTACCCGCAGCCCGCGACTTGTTCGCCCCTGGCGCAGCTGAGACCCACACACCTGCCCACCACCACCATCACAAGGAACAGCTCGGGTCGTCAGCGTCTGTCATCGTTGTCTGAAGAGGGCGGCGCGGCGCCCCTGTTAGCGCGGGCCTGTTTCGGTGTGAGCGGCTCGACTGGCAGGGCCGCCTCTGTGCCGCTCTACCTTGCAGCAGCTAACGGGGCATTGCTGCATCTGGCTCTACATCCCAAACCGGCCCGCAGTGTTCCCAAGGAGAAGATATGTGACGAGTCTCCCATCGAGCTGGAGGTTGAGGCGGTGTCACAGTGGCCTCTACAGCGTCCTGCCGCCGCCTCCGACCTCCTCGCCCCACTCCCTCCCTCCAACCCCCTCTTACAACCCATGAGCTGTAGGCGTTGCGCGGACATATGTATGAGTGAAGAGGAGCGCTGGCTGTCTCAGGTTGAGATCGTGACGCACGCCGGCCCGCATCGCAGACTGTGGATGGGACCGCAGTTCGTCTTCAAGACATACAACTGTACGGGGTCTACGTCATCACTATCAGAGGCAGAGGCGGTGGAGGTGGACGCCAGCGCAGCCCCGGCCCGCTCCAACCCCGTCAACATGCCAGGGGCGAGGCCAGCTGTGCCCGTCCTAATAGACTCCGGATCAGCCAGTTCCCTGGAACACTCTCCGTCTGACAGTTTCCGTCGCAAGTCGCTGCTGGAGCCGGGGCGAGTGTGCGACGTCCAGCTCAGAGAGGACCTCGCTGAGGCCATGAAGGAGGATCACGGGTTGCCGCGGGTGGAGCGCGCCTGCAGTGTGGAGCGCGGGGGAGCGGTCGTGGCCCGAGATGTCGGGCCGACCGGGGCCGTGGCCGCGCATCGCGAGGAGCCGCACGCGCCAGCCCTGGACGCGGACGCGCACACGACCTGCAATACTGACGAAGCGGCCTTCCGACCTGTAGTGCGTGCGCCGGCGACCCTGAGCCCGCTGACACCGGCGCTCTCAGCTCGAGAGCTTCCTTGTTGTACCACCATCCCAGCGCAGCCCGCGCCTCCGCGGCGAGCCTCGCCGGACGACAACCCGTTGCCTCTCACCACAGACGTCGTCATACCGGCCGAACTGACCGACGGTCGCCTCGAGTTCACTCATCTGCCGGCCGCGGAGCCCATTACTGATACGATCGGCGGGTTTGATTCCTTCGCAGACTTAGATGTTAAGAATACATTTGTACATAGAGATCTCGATTACAGTGAGCGAATGGAGAGTGAGAGAATGAACGATAGAGCCCCGGCACCGGACAAGAGTGATGAGGTGCCGAGTTCACTGCCAAAACCGAAACGGCCATCGGATGATATTCAACCGTCGGCTCGACCTAGAGTTAAGAAATCTCCCACCACAAAAACGCATAGTGATAGAGCAGCTAGTGATATAGATAAATTATGTGTGAATGATAAAGATTTTATGAATATACACAATGATGATATGTCTTCTAAACTCCGAGCAGCGGAGAAAGAAATTAAACCTCAGAAAGGAATGAAAGTGTCTAAAGGAATTGAAAAGCAGAATCAACAGGGTATTTCAAAAAACGATAAAGATATAACCCAGTCGCCAGAACCTAATGTGAGAGAGACCACCACGAAGTATGAAAAACATTCAGAAACGATTAGAACAGAAGATCAAGCTTGGGATATGCTTTTAAACGATACACAACAAACTTCTAAGAAAGACGTAAATCTTGTTACCGCAAACAAAGTGGAGATAAAAGATGACGTGAAAGCAAAAACTAAGAAAAGTCGTAAATCTAAAAAATCTATAGAAGATCAACAAGCCAAGGACGACGAAGACAGCTTTATAGAAATACATAATATAGAAGAGAAGCAAACGACCAGCGGAGATCTAGTTTCTATATCAATGCCTTTTGAGGACATTGAATCTTCTTACTTGCCGAAATCTAAGAGACGTTCAAAGAGCCGCACACCCGAGAGAAAAGATGTCGCCGAAAATAAACAAGAAAATATCAACGAACAAGATTTTGATATTCCAAACATTAGTAAGAAAAATAATAAAATGAATGAAATATTGACTACAGAATCTAAAACACAGCCGATTTCTTTGTCTACTACAAAGGATATAGATAGTAAACAGAAAGAATTATTAAATGTTGATGCTAAAACTGGTAACGAACCTAAGGCTGGCAAATCACTATCGACCAAAGAATCACCAAAATTAACAAAGCGCAAGTCCCCGTCACCGAAAGTTGATAGAAAAGAAGAAAACAAAGCTGACGATAAAGAAGTTTATGTCATTGAAACAACAGACGACGACTTTCCCGAAATACAAATAACGAAGGGGAATAAATCGAATAAGAGGTCTTTTCAACTTTATGAGAAGAAGAAAGAAGAAGCAGCGAAACCGGCTAAATCTTGGAGTTCCGTAGCTGCTTCTAAAAATAAAAAGGTCGACGAAGTCAAAGTTGTTACGGAAAATATTGAAGAACAAGAAACGGAAGACGAGGAAATGAAATCACCAGTATCGCTTCAAGAAAAGTTATTTGAATTGTGCAAAAGAAGAGACATAATGGTAGCTGAGTGCGATGCTCCATCAGAACTTAATTTTGTCGAGGAACATCATGCTGTGGTAGACCTCCCTCCTTTAGAGCAACTAGATTTCGGTCTAGACAACTTCTCACTGGAGGTCATGCGGGACAGTCTTCTGGAAGTCAACGAGCCGAAGGTCTCCAGTCCGATTTGCAAAATCAACATCGATGAAATCCTGTCTTCCATCAAAGAAACGACATCGAAAGCGATCGAAACCAGTACTTTCAATCTAATTGATGTCGAAAAAGTGCCTGCGAGGAAAGAAAGGGGCTTCAATATAGTCGAAAGCGATAAAATTACGTCCCAAGAAGTCAAATTGGAGGATGAGGTCAAGTTCGAGAAGGACGAACTTGAAAAATCATCTGACGAGGAGATGGCATCACCAGTTCTGTCGACTGACAGCGATAAAGAAGAAAAAAAATCAAGCGAAAACAGTAACGCGACCCCGACAGCGAAGCAATCTAAGTCTAAAAAGTCACGTAGGAAGAAGAAATTATAG

Protein sequence:

>DPOGS202657-PA
MSAESPRHARSGGPLTVPSQPPSDRSIIDAVSGFINDVTLSSSSTVDPKDVIQWARFETADINEPTQEGDGDNDVPPLLLILGYGSGVQVWLIPSNGEAQEVLSWRQGTVRVLRILPTPQHGDCFASKRPLIALCDSASPGPAFCSLIFLSIRGGEQVKSIKFKNPILDVLANKRSVVVSFSERFAVFDAATLEDRLAVTTCYPCPCPLGGSAPINPLTLGDRWLAYAEKKLNPSKRSSGGCETEGVTSYTATVLHAAKSLSKGLRGLGETVAHSLAGGRSTSQSPSPPHADIQQPGVVTILDIEGNEDEDSQDCEEPCDPIVAHFIAHSEAIIALKFDPSGMLLVTADRRGHDFHVFRINPHPCGPSLASVHHLYILHRGDTTSKVQDICISGDSRWAAISTLRGTTHVFAISPYGGAIGVRTHTQPRLVNRLSRFHRSAGLPIHHTSHVPPAAHSPVLESGAWFPNPRLPPYPQPATCSPLAQLRPTHLPTTTITRNSSGRQRLSSLSEEGGAAPLLARACFGVSGSTGRAASVPLYLAAANGALLHLALHPKPARSVPKEKICDESPIELEVEAVSQWPLQRPAAASDLLAPLPPSNPLLQPMSCRRCADICMSEEERWLSQVEIVTHAGPHRRLWMGPQFVFKTYNCTGSTSSLSEAEAVEVDASAAPARSNPVNMPGARPAVPVLIDSGSASSLEHSPSDSFRRKSLLEPGRVCDVQLREDLAEAMKEDHGLPRVERACSVERGGAVVARDVGPTGAVAAHREEPHAPALDADAHTTCNTDEAAFRPVVRAPATLSPLTPALSARELPCCTTIPAQPAPPRRASPDDNPLPLTTDVVIPAELTDGRLEFTHLPAAEPITDTIGGFDSFADLDVKNTFVHRDLDYSERMESERMNDRAPAPDKSDEVPSSLPKPKRPSDDIQPSARPRVKKSPTTKTHSDRAASDIDKLCVNDKDFMNIHNDDMSSKLRAAEKEIKPQKGMKVSKGIEKQNQQGISKNDKDITQSPEPNVRETTTKYEKHSETIRTEDQAWDMLLNDTQQTSKKDVNLVTANKVEIKDDVKAKTKKSRKSKKSIEDQQAKDDEDSFIEIHNIEEKQTTSGDLVSISMPFEDIESSYLPKSKRRSKSRTPERKDVAENKQENINEQDFDIPNISKKNNKMNEILTTESKTQPISLSTTKDIDSKQKELLNVDAKTGNEPKAGKSLSTKESPKLTKRKSPSPKVDRKEENKADDKEVYVIETTDDDFPEIQITKGNKSNKRSFQLYEKKKEEAAKPAKSWSSVAASKNKKVDEVKVVTENIEEQETEDEEMKSPVSLQEKLFELCKRRDIMVAECDAPSELNFVEEHHAVVDLPPLEQLDFGLDNFSLEVMRDSLLEVNEPKVSSPICKINIDEILSSIKETTSKAIETSTFNLIDVEKVPARKERGFNIVESDKITSQEVKLEDEVKFEKDELEKSSDEEMASPVLSTDSDKEEKKSSENSNATPTAKQSKSKKSRRKKKL-