Monarch geneset OGS2.0

DPOGS203557
TranscriptDPOGS203557-TA2412 bp
ProteinDPOGS203557-PA530 aa
Genomic positionDPSCF300055 + 574760-577171
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0121621e-1420.26% 
BombyxBGIBMGA006181-TA2e-0938.46% 
Drosophila% 
EBI UniRef50UniRef50_D7ELI75e-6832.02%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D7ELI7_TRICA
NCBI RefSeqXP_001942753.15e-6031.07%PREDICTED: similar to Uncharacterized protein ZK1236.4 [Acyrthosiphon pisum]
NCBI nr blastpgi|2700157432e-6732.02%hypothetical protein TcasGA2_TC004344 [Tribolium castaneum]
NCBI nr blastxgi|2700170422e-14636.27%hypothetical protein TcasGA2_TC004227 [Tribolium castaneum]
Group
KEGG pathwayhmg:1002024425e-07 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[6-207] IPR0051357.6e-07Endonuclease/exonuclease/phosphatase
Orthology groupMCL10176 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203557-TA
ATGATTGTTGCAACCGAAACATGGCTGGATGACACTGTTGCGGACGGGGAATTGTTCACGGATAAATACACCATCTACAGAAGAGACCGTCAGTCAACCTCCTCTCAACGCAAGACTGGTGGTGGAGTACTTATCGCAGTATCTAAATCTATAGAATCTAGACGAATTGAACATTTAGAAAGTAACGGTGAAGATCTATGGATTTCTGTAAAAGTATCCGAAAATGGAGTTTCTACGAACGTTTTATTTTGTGCAGTGTATATTCCTCCGCCTGTCTCGTTGGAAAGTTTAAACCTCATTTTGGACAATATAAGTGTTGTAATGCAATCTAACCCTGGTAAAGTTGTAGTTTTGGGGGATTTTAATTTGGGTTTTTTAAATTGGCGGCTGGACGAAATCGAGGATCTGTTAAAACCCGACCATACCGATAATATTTTGGGATTCCCTTTTGCCGATTTTTTGTCACTCAATTCACTAACACAATACAACAAAATTAAGAATCACAATCAACGTATACTAGATTTGGTCATGAATAATTTTAACAATTTAACAGTAATTGAAGCTTGCGAATTATTAAGTAACGTTGATCCTCACCATCCAGCACTAGAAATATCTTTGCGAATTAATGCCGATAAATTTTTATACAATAAAATATTTGCTACATATTTGTTTAGGAAGGCAGACTACGACGCGGTGAACAAATGTTTAGGAGAGTTTGATTGGTACTCCGAATTAGGGAGTTGCGGGAATGTCGATGCTATGGTAGAAAAATTGTATAATATCTTGTACGCTACAATCGATTCTCATGTTCCAAAAATCAAGCCAAAAGCATTTAAGTATCCTGTCTGGTTCACATCTTCCCTGATAAAATTAATTAACGAAAAGGAGAAAGTTAGGAAACTGTGCAAACGTTATAAGAATCCTAGAGATATATTAGAGTTAGCAAGAAGCCGTAGGCGCGTCGAGAATCTACTTGAATCATGTTACAGCCGTTACATTTCCGGAGTCGAGGATTCAATCTTTCGCGATCCCTTAAAATTTTGGCGTTTTGTTAAAAAACGGCGTGGAGCTAAATCCGAGGTCCCATCTCAAATGTCACTAGGTAACACCACAGCCCATTCTGGTCAAGCGATTTGTAACCTTTTCGCACAGAACTTTGCGTCCTCATATTCTTCTTTGCAGCCTTGTGTAGAATTCGTAGAGGAAAGTCCGGACTTTTATTCTCAAATGTCTCTCGCTCACGTCACACTAAATGAACGTACTATATCGAAGGCATTACGTTTAATAAATCCAAGCAAGTCAGCTGGTCCTGATGGCATTCCTCCTCTTTTTTTCAGAAAAACGTGTAAATTACTCGCACTCCCATTAAAAATTATTTATAACACATCTCTTCAGACTTCAACTTTTCCTACGAAGTGGAAGGAAGCTAATATTTTACCAATTCATAAAAAGAAATCAAAGAGCGATGTAAGAAATTACCGACCGATTTCAATGTTAAAGGTATTATCCAAAGCCTTTGAAAGTATAATTACCCCTATTTTGTCCCAATATATCAAAACCATGATTACAAGTGATCAGCATGGATTTTGCTAGCGTAAGTCAACCACTACAAATCTCACAACTTACATACATTACATTTCTGGGTGCATAGATAATAAACAACAGGTCGACTCGATCTATACCGACTTCAGTAGCGCGTTTGACAAAGTTAATCACAAAATCTTAATTTACAAGCTTAGGTTATACGGCATTCATGACCCCCTGCTATCTTGGTTCCGATCGTACTTATCTGATCGAGTACAGAGAGTTACACTCGGTGGTTACAAATCTCACGAGTTTATTGCGACCTCTGGTGTTCCACAGGGATCACACCTAGGCCCAATATTATTTATCATCTTTATCAATGACATTGCACACTGTATCAAGAACAGCAAATGTTCTATCTTTGCTGACGATTTAAAGATTTACCGCTCTATAACATCCATGAATGATTGCAAATTGCTCCAAGACGATCTAAACTCCATACAAAAATGGTGTGATTTGAATGGAATGGAACTAAATGTCGATAAGTGCCAGTTTATTAAGTTTACGCGAAACAAAAACATAATAAGACATCACTATAGCTTACAGGGCAAACCACTCGGGGAAGTTTCGGTTGTTCGTGATTTAGGTGTTATGATCGACGCCAAACTTAAATTTAACACTCACATAAATCATATTGCCACGAAGGCCTGGAAGAACTTCGGTTTTGTCAGACGTAACTGCAATGATTTTAAGAGACCGGCAACTATCATAACTTTGTACAATTCGTTTATACGTAGCGCTTTGGAATATGCGGTTGTAGTATGGAATCCGCATTATAAGATTTACATTGAAAGGTTGGAACGCATACAACACAAATTTACAAAATGA

Protein sequence:

>DPOGS203557-PA
MIVATETWLDDTVADGELFTDKYTIYRRDRQSTSSQRKTGGGVLIAVSKSIESRRIEHLESNGEDLWISVKVSENGVSTNVLFCAVYIPPPVSLESLNLILDNISVVMQSNPGKVVVLGDFNLGFLNWRLDEIEDLLKPDHTDNILGFPFADFLSLNSLTQYNKIKNHNQRILDLVMNNFNNLTVIEACELLSNVDPHHPALEISLRINADKFLYNKIFATYLFRKADYDAVNKCLGEFDWYSELGSCGNVDAMVEKLYNILYATIDSHVPKIKPKAFKYPVWFTSSLIKLINEKEKVRKLCKRYKNPRDILELARSRRRVENLLESCYSRYISGVEDSIFRDPLKFWRFVKKRRGAKSEVPSQMSLGNTTAHSGQAICNLFAQNFASSYSSLQPCVEFVEESPDFYSQMSLAHVTLNERTISKALRLINPSKSAGPDGIPPLFFRKTCKLLALPLKIIYNTSLQTSTFPTKWKEANILPIHKKKSKSDVRNYRPISMLKVLSKAFESIITPILSQYIKTMITSDQHGFC-