Monarch geneset OGS2.0

DPOGS213250
TranscriptDPOGS213250-TA7743 bp
ProteinDPOGS213250-PA2580 aa
Genomic positionDPSCF300124 + 289318-301523
RNAseq coverage782x (Rank: top 16%)
Annotation
HeliconiusHMEL0078550.062.32% 
BombyxBGIBMGA009443-TA0.075.34% 
Drosophilazye-PA0.040.59% 
EBI UniRef50UniRef50_E0VCT40.050.33%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VCT4_PEDHC
NCBI RefSeqXP_002423928.10.050.33%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420061710.050.33%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|1951278900.042.11%GI11840 [Drosophila mojavensis]
Group
KEGG pathwayrno:3030024e-08 
 K12485 (RAB11FIP3_4)maps-> Endocytosis
InterPro domain[91-350] IPR0015073.4e-33Zona pellucida sperm-binding protein
Orthology groupMCL18881 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213250-TA
ATGACCAATGTCGATTCTGTGGTAATTATTGTTATGCAAGAGCCCCATGAAGCGGCTGTCATGAAAATAGCGACATTGATTGCACTTGTGCAGCTGTCGCTTGCTACAGCTGTTTCAGATGGCACGCTCACCGCAGCTGAGTTAAATAAAGAATTGGCTGGTGATAATAGTCTAAGTCCATTCTTTGATGAAATCATCGATGACACGGCGGTGAATTTCGTCCGGAGCTCCAGGGCTGTCGACAGTCCTCTATCTCCGGATATCAATGTACAATGCTCAGCTGACTTCATTGATGTGACAGTCGAATTCTCTGATGTGTACGACGGCATTATCTACAGCAAAGGCTACTTAAATGATCCGAAATGCAAATACGTATCTCTCGGCGGAAGTCAGTCACGGTACTCATTCCGAGTACCGCTGAAGGGCTGCGGCAGCAGATCTCTTTGCAACGCTTGCGGCACCATTGATAATGTCCTCGTGTTCCAAGCCGACGACTTCGTCCAAGGTCCCTATGATTTCGCACGAAAAGTTTCATGCGCAAGTACAGCTTTAGAGGTATCTGTCGGAGGAGTAAGGAAAGAGCAATCGCATATTTTGAAACTCAAGCCTTTTATGGTTGACATGCTCGATGTAGTAGCTGTGCAAGGGCCCGCAGGCGGCGTCGAATGTTGGATGGACATACAGAAGGGAGTATTCCCTAATACTACTCCTTTAGAACACTCGATCAAGATCGGCGAATACCTCACGATATTAATATACCTAAAAGATACTAGGAATCAGTTCAACCTAAAAATACACGATTGTTGGGCATATGATAATGAAGACTATGATAACCCGAATACTAATAAGATTCAGTTAACAGACAAGGAAGGATGTCCAAAGAAAAGGAAATTTATTGATTTATTCCAAAAATCCACAAACACGGGTAAATCCGGGGCGACACTGATAGCCTATAGCAAAGTCAGCGCCTTCCGCTTCCCTGAAACGGATCAAGTATATCTAACGTGTAATGTTGAGCTGTGCAAGAGTGATTGTGATTCAAGCTGCAGAGACATTATCAAACCAATAACGACCACAAAAAAACCGCAAATAATACCTTCCTGTTACCCTGGCAGCACTGACCCACGTTGTCCACGACCAACTACTCCTGAAGCCCCAAGATGTTATCCAGGAAGTACTGATCCACGTTGCCCACAACCTACTACACCGGCTCCACCTAAATGCTTCCCTGGATCCACAGACCCACGTTGCCCAAGGCCAACAACTCCTGAAGCACCTAGATGCTACCCCGGAAGCAATGACCCACGCTGCCCAAAACCAACTACACCGGAGTCACCTCGCTGCTACCCAGGAAGCAGTGACTCACGTTGTCCAAGACCAACTACTCCTGAGGCCCCCAGATGTTATCCAGGAAGTACTGATACACGTTGCCCACAACCTACTACACCGGTTCCACCTAAATGCTTCCCTGGATCCACAGACCCACGTTGCCCAAGACCAACTTCACCAGAGTCGCCTCGTTGCTACCCAGGTAGCACTGACCCACGTTGTCCACAACCAACAACTCCTGAAGCACCTAGCTGCTACCCCGGAAGTAGTGACCCACGTTGTCTAAGACCAACGACTCCTGAAGCACCGAGATGCTATCCCGGCAGTGACGACCCACGTTGTCCAAAACCTACTACACCGGCTCCCCCCAAATGCTTCCCCGGATCCACTGATCCACGCTGTCCAAGGCCGACGACTCCCGAAGCACCAAGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCGAAACCAACTACACCAGAATCACCTCGCTGCTACCCAGGAAGTAGTGACCCACGTTGTCCAAGACCAACGACTCCTGAGGCACCAAGATGCTACCCCGGAAGCAGTGACCCTCGTTGTCCCAAACCTACTACACCAGCCCCACCAAGATGTTTCCCAGGATCCACTGATCCACGTTGCCCACAACCAACAACTCCTGAGGCACCAAGATGCTACCCCGGAAGCAGTGACCCTCGTTGTCCCAAACCTACTACACCAGCCCCACCAAGATGTTTCCCAGGATCCACTGATCCACGTTGCCCACAACCTACTACTCCTGAGGCACCTAGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCATCAAAATGCTTCCCGGGATCAAGTGACCCACGTTGTCCGGGCTGCGGCAGCAGATCTCTTTGCAACGCTTGCGGCACCATTGATAATGTCCTCGTGTTCCAAGCCGACGACTTCGTCCAAGGTCCCTATGATTTCGCACGAAAAGTTTCATGCGCAAGTACAGCTTTAGAGGTATCTGTCGGAGGAGTAAGGAAAGAGCAATCGCATATTTTGAAACTCAAGCCTTTTATGGTTGACATGCTCGATGTAGTAGCTGTGCAAGGGCCCGCAGGCGGCGTCGAATGTTGGATGGACATACAGAAGGGAGTATTCCCTAATACTACTCCTTTAGAACACTCGATCAAGATCGGCGAATACCTCACGATATTAATATACCTAAAAGATACTAGGAATCAGTTCAACCTAAAAATACACGATTGTTGGGCATATGATAATGAAGACTATGATAACCCGAATACTAATAAGATTCAGTTAACAGACAAGGAAGGATGTCCAAAGAAAAGGAAATTTATTGATTTATTCCAAAAATCCACAAACACGGGTAAATCCGGGGCGACACTGATAGCCTATAGCAAAGTCAGCGCCTTCCGCTTCCCTGAAACGGATCAAGTATATCTAACGTGTAATGTTGAGCTGTGCAAGAGTGATTGTGATTCAAGCTGCAGAGACATTATCAAACCAATAACGACCACAAAACCGCAAATAATACCTTCCTGTTACCCTGGCAGCACTGACCCACGTTGTCCACGACCAACTACTACTGAAGCCCCAAGATGTTATCCAGGAAGTACTGATTCACGTTGCCCACAACCGACTACACCAGGGTCACCTCGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCGAAACCAACTACACCAGAATCACCTCGCTGCTACCCGGGAAGTAGTGACCCACGTTGCCCACGACCAACAACTCCTGAAGCACCTCGATGCTATCCCGGAAGTACTGACTCACGTTGTCCCAAACCCACAACACTACCACCACCAAGATGTTTCCCAGGGAGCGACGATCCTCGCTGTCCCAAAGTAACAACACCAGTTCCAACAAAACCCTCTTGTTACCCAGGCTCACCTGATCCTAACTGTCCACAACCGTCGAGGCCAACAACGCTTAATCCACCTACTTATTTACCTCCTTCAACACCAGGACTACCAAAATGCTTCCCCGGCAGCAATGACCCACGTTGTCCTAAACCCACTACACCAGCCCCACCAAGATGTTTCCCAGGATCCACTGATCCACGTTGCCCACAACCTACTACTCCTGAGGCACCTAGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCACCGAAATGCTTCCCCGGTAGCAATGACCCTCGTTGCCCCAAGTTAACTACACCAGAATCACCTCGCTGCTACCCAGGAAGTAGTGACCCACGTTGTCCAAGACCAACGACTCCTGAGGCACCAAGATGCTACCCCGGAAGCAGTGACCCTCGTTGTCCCAAACCTACTACACCAGCCTCACCAAGATGTTTCCCAGGATCCACTGATCCACGTTGCCCACGACCAACGACTCCTGAGGCACCTAGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCATCAAAATGCTTCCCGGGATCAAGTGACCCACGTTGTCCGCGACCAACTACTCCTGAAACCCCAAGATGTTATCCAGGAAGTACTGATTCACGTTGCCCACAACCGACTACACCAGGGTCACCTCGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCACCAAAATGCTTCCCGGGATCAAGTGACCCACGTTGTCCACGACCAACTACTACTGAAGCCCCAAGATGTTATCCAGGAAGTACTGATTCACGTTGCCCACAACCGACTACACCAGGGTCACCTCGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCGAAACCAACTACACCAGAATCACCTCGCTGCTACCCGGGAAGTAGTGACCCACGTTGCCCACGACCAACAACTCCTGAAGCACCTCGATGCTATCCCGGAAGTACTGACTCACGTTGTCCCAAACCCACAACACTACCACCACCAAGATGTTTCCCAGGGAGCGACGATCCTCGCTGTCCCAAAGTAACAACACCAGTTCCAACAAAACCCTCTTGTTACCCTGGCTCACCTGATCCTAACTGTCCACAACCGTCGAGGCCAACAACGCTTATTCCACCTACTTATTTACCTCCTTCAACACCAGCACCACCAAAATGCTACCCTGGTAGCAATGACTCTCGCTGTCCAAGACCAACGACTCCTGAAGCACCCAGATGTTACCCCGGTAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCACCGAAATGCTTCCCCGGTAGCAATGACCCTCGTTGCCCGAAGTCAACAACACCAGAATCACCTCGCTGCTATCCCGGAAGTAGTGACCCACGTTGTCCAAGACCAACGACTCCTGAGGCACCAAGATGCTACCCCGGAAGCAGTGACCCTCGTTGTCCCAAACCTACTACACCAGCCTCACCAAGATGTTTCCCAGGATCCACTGATCCACGTTGCCCACGACCAACGACTCCTGAGGCACCTAGATGCTACCCTGGAAGCAATGACCCTCGTTGCCCTAAACCCACTACACCAGCCCCATCAAAATGCTTCCCGGGATCAAGTGACCCACGTTGTCCGCGACCAACTACTCCTGAAACCCCAAGATGTTATCCAGGAAGTACTGATTCACGTTGCCCACAACCGACTACACCAGAGTCACCTCGATGCTACCCTGGAAGCAATGACCCTCGTTGTCCAAAACCCACTACACCTACTCCACCTAATTGCTTCCCGGGATCCACTGATCCACGTTGTCCACAACCCACGACCCCCGACGTACCTAGATGTTATCCCGGAAATACTGACCCACGTTGTCCTAAATCCACTACCCCAGCACCACCAAGATGTTTCCCAGGGAGCACTGATCCTCGCTGTCCTAAAGTACCAACACCACTACCAACGAAACCCTCTTGTTACCCAGGCTCACCAGATCCTAACTGTCCACAACCGTCGAGGCCAACAACACTTAATCCGCCTACTTATTTACCTCCTTCAACACCAGCACCACCAAAATGCTACCCTGGTAGCAATGACTCACGTTGTCCCAGCAGACCAACAACTCCGGAAGAAGCACCTAGATGTTACCCTGGAAGCAATAATCCTCGTTGCCCGAAACCAACTACATCAGAATCACCTCGCTGCTACCCAGGAAGTAGCGACCCACGTTGTCCACGACCAACAACTCCTCTGAAGCACCTCGAGATATGCTACCCCCCGGAAGCAATGACGCTCGTTGTCCTAAACCCACTTCACCAGCCCCACCTAAATGCTTCCCAGGATCCTCTAATCCACGTTGCCCACAACCGACTACACCAGAGCCGCCAAGGCTCACTAGATCCTAACTGTCCACAACCGTCTAGACCAACAACTCTTAATCCACCTACTTACTTACCTCCTTCTTCAACAATTCCACCAAAATGTTTCCCAGGAAGTTCGGATCCTCGTTGTCCTCGACCCACAACTCCTGATACTCCCAGATGCTATCCAGGAAGTACAGATTCTCGTTGCCCAAAACCTTCAAGTCTTGGACCACCGAAATGTTTCCCAGGAAATTCGGATCCTCGTTGTCCACGACCAACAAGCCCGCAACCAAGCTCGACGCCCAGTTCATGTTACCCTGGGTCAAAGGATCCAAAATGTCCTCAACCATTCGCGCCCAGTAGCACAAATCCCCCAGTCACCTACTTACCACCATTTCCTGCAGATAACGAAATTAGCTCTGGAAGATTAGCCAATAAGAATTTCGATTATTACGATGACTCACAATTAGCAATCGGCCGACGACTCCTGAAGCACCAAGATGCTACCCTGGAAGCAATGACCCTCGTTGTCCTAAACCCACTACACCAGCCCCACCGAAATGCTACTCAGGATCCACCGACCCTCGCTGCCCGAAACCAACTTCACCAGAGTCGCCTCGTTGCTACCCAGGTACCACTGACCCACGTTGTCCACAACCAACAACTCCTGAAGCACCTAGATGCTATCCCGGAAGCAATGACGCTCGTTGTCCTAAACCCACTACACCAGCCCCACCTAACTGCTTCCCGGGATCAAGTGATCCACGTTGTCCACAACCCACGACCCCCGACGTACCTAGATGTTATCCCGGAAGCCCTGACCCACGTTGTCCTAAACCGACTACCCCAGCACCACCAAGATGTTTCCCAGGGAGCAGATCCTCGCTGTCCTAAAGTAACAACACCACTACCAACGAAACCCTCTTGTTACCCAGGCTCACCAGATCCTAACTGTCCACAACCGTCGAGGCCAACAACACTTAATCCGCCTACTTATTTACCTCCTTCAACACCAGCACCACCAAAATGCTACCCTGGTAGCAATGACTCACGTTGTCCCAGACCAACAACTCCGGAAGCACCTAGATGTTACCCTGGAAGCAATAATCCTCGTTGCCCGAAACCAACTACATCAGAATCACCTCGCTGCTACCCAGGAAGTAGCGACCCACGTTGTCCACGACCAACAACTCCTGAAGCACCTCGATGCTACCCCGGAAGCAATGACGCTCGTTGTCCTAAACCCACTTCACCAGCCCCACCTAAATGCTTCCCAGGATCCTCTAATCCACGTTGCCCACAACCGACTACACCAGAGCCGCCAAGGTGTTTCCCAGGAAGTAACGATCCTCGCTGTCCTAAAGTAATAACACCAGTTCCTACAAAACCATCTTGTTACCCAGGCTCACTAGATCCTAACTGTCCACAACCGTCTAGACCAACAACTCTTAATCCACCTACTTACTTACCTCCTTCTTCAACAATTCCACCAAAATGTTTCCCAGGAAGTTCGGATCCTCGTTGTCCTCGACCCACAACTCCTGATACTCCCAGATGCTATCCAGGAAGTACAGATTCTCGTTGCCCAAAACCTTCAAGTCTTGGACCACCGAAATGTTTCCCAGGAAATTCGGATCCTCGTTGTCCACGACCAACAAGCCCGCAACCAAGCTCGACGCCCAGTTCATGTTACCCTGGGTCAAAGGATCCAAAATGTCCTCAACCATTCGCGCCCAGTAGCACAAATCCCCCAGTCACCTACTTACCACCATTTCCTGCAGATAACGAAATTAGCTCTGGAAGATTAGCCAATAAGAATTTCGATTATTACGATGACTCACAATTAGCAATCGATAATTTTGATTTCTCTCGTACAAAACCCAAAACGAGAAATGTAAGAGATTTAAGTAAATCATCCGCTGAAGAAAACCTTGGTCAGTTTTCCGGATTCGACAACCTGGCTATTATAGGTTCCGTATTGTTTGTTGCCTTGATCATAGCAGTTGCTATTGGTTCAATGAAATACAAACAAATGAAAAGGAAAAATGGTTTAAATGTACAATCAACTCAGAATCCTTGTTAG

Protein sequence:

>DPOGS213250-PA
MTNVDSVVIIVMQEPHEAAVMKIATLIALVQLSLATAVSDGTLTAAELNKELAGDNSLSPFFDEIIDDTAVNFVRSSRAVDSPLSPDINVQCSADFIDVTVEFSDVYDGIIYSKGYLNDPKCKYVSLGGSQSRYSFRVPLKGCGSRSLCNACGTIDNVLVFQADDFVQGPYDFARKVSCASTALEVSVGGVRKEQSHILKLKPFMVDMLDVVAVQGPAGGVECWMDIQKGVFPNTTPLEHSIKIGEYLTILIYLKDTRNQFNLKIHDCWAYDNEDYDNPNTNKIQLTDKEGCPKKRKFIDLFQKSTNTGKSGATLIAYSKVSAFRFPETDQVYLTCNVELCKSDCDSSCRDIIKPITTTKKPQIIPSCYPGSTDPRCPRPTTPEAPRCYPGSTDPRCPQPTTPAPPKCFPGSTDPRCPRPTTPEAPRCYPGSNDPRCPKPTTPESPRCYPGSSDSRCPRPTTPEAPRCYPGSTDTRCPQPTTPVPPKCFPGSTDPRCPRPTSPESPRCYPGSTDPRCPQPTTPEAPSCYPGSSDPRCLRPTTPEAPRCYPGSDDPRCPKPTTPAPPKCFPGSTDPRCPRPTTPEAPRCYPGSNDPRCPKPTTPESPRCYPGSSDPRCPRPTTPEAPRCYPGSSDPRCPKPTTPAPPRCFPGSTDPRCPQPTTPEAPRCYPGSSDPRCPKPTTPAPPRCFPGSTDPRCPQPTTPEAPRCYPGSNDPRCPKPTTPAPSKCFPGSSDPRCPGCGSRSLCNACGTIDNVLVFQADDFVQGPYDFARKVSCASTALEVSVGGVRKEQSHILKLKPFMVDMLDVVAVQGPAGGVECWMDIQKGVFPNTTPLEHSIKIGEYLTILIYLKDTRNQFNLKIHDCWAYDNEDYDNPNTNKIQLTDKEGCPKKRKFIDLFQKSTNTGKSGATLIAYSKVSAFRFPETDQVYLTCNVELCKSDCDSSCRDIIKPITTTKPQIIPSCYPGSTDPRCPRPTTTEAPRCYPGSTDSRCPQPTTPGSPRCYPGSNDPRCPKPTTPESPRCYPGSSDPRCPRPTTPEAPRCYPGSTDSRCPKPTTLPPPRCFPGSDDPRCPKVTTPVPTKPSCYPGSPDPNCPQPSRPTTLNPPTYLPPSTPGLPKCFPGSNDPRCPKPTTPAPPRCFPGSTDPRCPQPTTPEAPRCYPGSNDPRCPKPTTPAPPKCFPGSNDPRCPKLTTPESPRCYPGSSDPRCPRPTTPEAPRCYPGSSDPRCPKPTTPASPRCFPGSTDPRCPRPTTPEAPRCYPGSNDPRCPKPTTPAPSKCFPGSSDPRCPRPTTPETPRCYPGSTDSRCPQPTTPGSPRCYPGSNDPRCPKPTTPAPPKCFPGSSDPRCPRPTTTEAPRCYPGSTDSRCPQPTTPGSPRCYPGSNDPRCPKPTTPESPRCYPGSSDPRCPRPTTPEAPRCYPGSTDSRCPKPTTLPPPRCFPGSDDPRCPKVTTPVPTKPSCYPGSPDPNCPQPSRPTTLIPPTYLPPSTPAPPKCYPGSNDSRCPRPTTPEAPRCYPGSNDPRCPKPTTPAPPKCFPGSNDPRCPKSTTPESPRCYPGSSDPRCPRPTTPEAPRCYPGSSDPRCPKPTTPASPRCFPGSTDPRCPRPTTPEAPRCYPGSNDPRCPKPTTPAPSKCFPGSSDPRCPRPTTPETPRCYPGSTDSRCPQPTTPESPRCYPGSNDPRCPKPTTPTPPNCFPGSTDPRCPQPTTPDVPRCYPGNTDPRCPKSTTPAPPRCFPGSTDPRCPKVPTPLPTKPSCYPGSPDPNCPQPSRPTTLNPPTYLPPSTPAPPKCYPGSNDSRCPSRPTTPEEAPRCYPGSNNPRCPKPTTSESPRCYPGSSDPRCPRPTTPLKHLEICYPPEAMTLVVLNPLHQPHLNASQDPLIHVAHNRLHQSRQGSLDPNCPQPSRPTTLNPPTYLPPSSTIPPKCFPGSSDPRCPRPTTPDTPRCYPGSTDSRCPKPSSLGPPKCFPGNSDPRCPRPTSPQPSSTPSSCYPGSKDPKCPQPFAPSSTNPPVTYLPPFPADNEISSGRLANKNFDYYDDSQLAIGRRLLKHQDATLEAMTLVVLNPLHQPHRNATQDPPTLAARNQLHQSRLVATQVPLTHVVHNQQLLKHLDAIPEAMTLVVLNPLHQPHLTASRDQVIHVVHNPRPPTYLDVIPEALTHVVLNRLPQHHQDVSQGADPRCPKVTTPLPTKPSCYPGSPDPNCPQPSRPTTLNPPTYLPPSTPAPPKCYPGSNDSRCPRPTTPEAPRCYPGSNNPRCPKPTTSESPRCYPGSSDPRCPRPTTPEAPRCYPGSNDARCPKPTSPAPPKCFPGSSNPRCPQPTTPEPPRCFPGSNDPRCPKVITPVPTKPSCYPGSLDPNCPQPSRPTTLNPPTYLPPSSTIPPKCFPGSSDPRCPRPTTPDTPRCYPGSTDSRCPKPSSLGPPKCFPGNSDPRCPRPTSPQPSSTPSSCYPGSKDPKCPQPFAPSSTNPPVTYLPPFPADNEISSGRLANKNFDYYDDSQLAIDNFDFSRTKPKTRNVRDLSKSSAEENLGQFSGFDNLAIIGSVLFVALIIAVAIGSMKYKQMKRKNGLNVQSTQNPC-