Monarch geneset OGS2.0

DPOGS205798
TranscriptDPOGS205798-TA3402 bp
ProteinDPOGS205798-PA1133 aa
Genomic positionDPSCF300144 + 47269-55179
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0079555e-10841.63% 
BombyxBGIBMGA010364-TA5e-15138.21% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020638224e-2931.70%UPI0002063822 related cluster n=1 Tax=unknown RepID=UPI0002063822
NCBI RefSeqXP_001599329.15e-2030.41%PREDICTED: similar to breast and ovarian cancer susceptibility protein [Nasonia vitripennis]
NCBI nr blastpgi|3800280793e-2932.46%PREDICTED: uncharacterized protein LOC100867025 [Apis florea]
NCBI nr blastxgi|3407182301e-3121.12%PREDICTED: hypothetical protein LOC100649216 [Bombus terrestris]
Group
Gene OntologyGO:00056222.3e-17intracellular
KEGG pathwaymmu:121892e-21 
 K10605 (BRCA1)maps-> Ubiquitin mediated proteolysis
InterPro domain[911-1015] IPR0013572.3e-17BRCT
Orthology groupMCL25731 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205798-TA
ATGCTACTGATAGATTTGGATCTAAACAAACTTTCGAAATTAGCATCACAACAAATTGACCACGTAACTTGTATAGAATGTTGTAAGTATTATACAGTACCCACAACAGCAGACTGTGGTCACTCTTTGTGTCATTCTTGCTGGCGCGGCCGTAGAGTCTGTGCTTCTTGTGGGAAATCCATCGAAAAGAAAAATCTAAAATTAAATATTCCACTCCAACATCTCACCGAACATATTTCAAGTCTATCTAAAACCTTGGAAGACCTCTTTAATATAAAGTTGGATGAGTATTTACTAGATTCACCAACAGAAAATCAAGATGAACCAAATAAGAATGTTAAAGAATGGTTGGCAAGTTCACAAAACCAGTTTTCTGCTCCTATTACAAATTCTACCCAATCAACTCAAGGACAGATGGAAGCAGAAAAAGTTACGAGCGACATACAAGTTCATTCTGTAAATAAGAGAATTTCAAATCATAAAGAAGTTATTCATATTCCGCTTCAACAAGATGACTGGGACAATATAGAAGAAATGCCAGAGATGGACAAAAATACTCATAAGAATCAAGAAAATGTTGTAGGTCCTATGGATATTGAACCTTTTGAATTTATGATGGATGATGAGTATAATGCCAACAATCTTCGACGTAGCTTAAGGAACAAAGATAACAAATCTCCTAACAGTGATCTTAAACAAGATGTCCAACAAGATAAAACTAACAGCCTAGAGACAAAAAAAACATCTGAAAAAAGTGGCCTCAAGGTAGCTAAAAATTGGAATAATGTTAAAAGAATGAAAAAAGAATTCAGTAAATTAAACAAAAAACATAGAAATAAATTGAATGTGTCTATAGAAATGTGTAAAAAAAAAATAAATAAAATTGTACCACCAATAACTCAGGAAAATTTATATACTATTGATGATAATACACCACAAATTGTAAACAATGAAAAAGATGATGAACTCATTAATAACTGTGATGTTAATAAAAAAAATGATTCTGAACAAGTTACAAGCAGTGATTCTGTTAAAAACAACATTACAAATAGTGTAACAGGTCATAAAATTATAAACGAAAACAATGAATTACCTCCGCCGAAAGTGGCATTTGTTAAGAAAAGTACTCTTATATACAAAGCGCAAGAAATTAATGATATAAAAATTCCAACAGACAGACATTCAACTGGGGTGGCTGTGAACAATAATGATATAGAGATCACTATAAAAATAGGAAATACTCTTACAAACATTTGTATAAAGAAAAAGGACAATGATGTACAATTAAAAATTAATTCAGACAGAGAAGTTCAGACATCATTAGAAAATGATAACAAAAATAAAAACGATATATCTAGTTCTAAAATTACAATTAACAATATGGAAAATACTGTCCATTGTGATAACATTCCCAATAAAAATATGGTTTCTAAGATAAATAGTGCGAATAAAACAGTCTCTAAGAAGAACACAGCATCAGCTGACACTAGCACCGCTCAATTTGAAATAACAAAAAGCATCGGGGAAGAAATGATGAAATCTTTAAACACTAACCAAGAAAGCATAAAGAAAACACCATGCGGAAATGTACTACAAGAAAATAGCAGTCAGGTGGTATTAAACAAAATAACTGAAATTCAAGAAGAGGCTGATATGGATATTTTTGACACTGATTCAATCAAAGAAGCAAATGTCGCCCCGATGAAATCAGCAAAAAATACCCCATCAGCTATCCTACCTTCTTTCAGGACATCGAAAACTAAGACTCCGAAATCCGATAAAAGAAATAGGGAAACAGATTTTGGTGAAAATCTGCCGAGTAACAAAAAAAGAAAGATCTCACCATACAAAGAGATCCAAGCGACCAATGAAAAGAGAGACAAAGATCGTTCTATGGAACATGATCATGTCAATGATGAGGCTTATTGCCAAATGTTAGACAACATGAATACTATGGAAGATGTTAAAAATAATATGAAAAAAATAAACAAATCTCAGGCCATTAAACCGTTTGATAAATCACCATCACAAATAAGTGTCATAAATAATTCCTTTGACAATAAAACATCAAAAGACAAACACATTGAAAAATACAGTGAAAATGTCTTTTCTATGCTGGATAATGAGTCTCAAATACTGAAAACATTAAACTATAAGACACAACAAAGTCAATGCAACAAAGATTCTGATGGAGAGAAGGTAATAAATAACCAGAAAGGGAATGCAGATTTAGATGTCATGGAGCTTTGCACTCCTGTGGCGGATGAAGATTCAGACAAGAGTGTAGTCGAAGATACACCACAAAAGACGCCATCCTTCTCTAAAATACGAAACAAAAACGAATCAACCGTCAACAGCAACAATGTTACGATGAAAGAAAATATAAAACTTAAAGTTCAAAACAGACAAACAGACATTGAGGCAGATATCATAAGTTTATCTGATACAGTCGGAGAGTCTACAAAAAATATTACAGTATTAGAAACATTGCCCCGACCAACATTGGAGACTCCTATCACAATAACAAAGTTTGTGGACCAAATAAAACATAAATCGACACCCGTAGCAAGAAAATCACTTAAATTTAACAATGAAAATAGTGAAGCTGACGTCACTATGTGTCCAAGCTCGTTTGTCCTCGCAAAAACCACACAAGAGAAGGAATTCTTAAGCAAAGCATTCGAACAAACCATAGATTCAGAATGTGTGAGACACAACGAGAGGTGTTTAAAGTATTGTATTGCTGGATCATGTCTGACCGCATCTGAACAGAGCAATGTGAAAATTTTATGCTCCAAAAGAAATTGGACATATGTCGAAAAATACACTAAAGAATTAACACATCTGGTTGTTGGTGTGGACGAGGAGAATAAAGCACAACGATCCGTTAAATTCATGTGTGCTATAGCCGGTGGGAAATGGATTATATCATATGAATGGATTGAAAAATGTTTACAAGTAAATGGCGTGGTAGATGAGGAACAGTTCGAAGCGTTAGATGCTACGGGCGAGCCTGGTCCGAGACGCTCGCGTTTGGCTAAACAAAAACTATTCACCGGCATCAGCTTCTACTGTATGCCGCCGTTCAGAGTACTAGATGTCGATACTTTGAAGGACATCCTTCAGTGTTCAGGCGGCCGCGTTGTTGTAGAAGCTAGAGATGTCCGAGCGTCGAGCACGCCCTCGCTGTTGTTAGCCGAACCCGAGCACACACAGGAGGACCGCTTCATATATCTTGCTATGGAGCTGAGCGTGGTGCCAATAAACTACGAATGGGTGCTCAATTGTCTTGGAAGCTACTCGTTGGAATCCATTTACGAACTATTATTATGTCCGGCCTCTTTGTTGCCGCCGGTCGTTGCAAAATGGCCGCCAGAACTTATATCACATGATGTTGAATAA

Protein sequence:

>DPOGS205798-PA
MLLIDLDLNKLSKLASQQIDHVTCIECCKYYTVPTTADCGHSLCHSCWRGRRVCASCGKSIEKKNLKLNIPLQHLTEHISSLSKTLEDLFNIKLDEYLLDSPTENQDEPNKNVKEWLASSQNQFSAPITNSTQSTQGQMEAEKVTSDIQVHSVNKRISNHKEVIHIPLQQDDWDNIEEMPEMDKNTHKNQENVVGPMDIEPFEFMMDDEYNANNLRRSLRNKDNKSPNSDLKQDVQQDKTNSLETKKTSEKSGLKVAKNWNNVKRMKKEFSKLNKKHRNKLNVSIEMCKKKINKIVPPITQENLYTIDDNTPQIVNNEKDDELINNCDVNKKNDSEQVTSSDSVKNNITNSVTGHKIINENNELPPPKVAFVKKSTLIYKAQEINDIKIPTDRHSTGVAVNNNDIEITIKIGNTLTNICIKKKDNDVQLKINSDREVQTSLENDNKNKNDISSSKITINNMENTVHCDNIPNKNMVSKINSANKTVSKKNTASADTSTAQFEITKSIGEEMMKSLNTNQESIKKTPCGNVLQENSSQVVLNKITEIQEEADMDIFDTDSIKEANVAPMKSAKNTPSAILPSFRTSKTKTPKSDKRNRETDFGENLPSNKKRKISPYKEIQATNEKRDKDRSMEHDHVNDEAYCQMLDNMNTMEDVKNNMKKINKSQAIKPFDKSPSQISVINNSFDNKTSKDKHIEKYSENVFSMLDNESQILKTLNYKTQQSQCNKDSDGEKVINNQKGNADLDVMELCTPVADEDSDKSVVEDTPQKTPSFSKIRNKNESTVNSNNVTMKENIKLKVQNRQTDIEADIISLSDTVGESTKNITVLETLPRPTLETPITITKFVDQIKHKSTPVARKSLKFNNENSEADVTMCPSSFVLAKTTQEKEFLSKAFEQTIDSECVRHNERCLKYCIAGSCLTASEQSNVKILCSKRNWTYVEKYTKELTHLVVGVDEENKAQRSVKFMCAIAGGKWIISYEWIEKCLQVNGVVDEEQFEALDATGEPGPRRSRLAKQKLFTGISFYCMPPFRVLDVDTLKDILQCSGGRVVVEARDVRASSTPSLLLAEPEHTQEDRFIYLAMELSVVPINYEWVLNCLGSYSLESIYELLLCPASLLPPVVAKWPPELISHDVE-