Monarch geneset OGS2.0

DPOGS208099
TranscriptDPOGS208099-TA2316 bp
ProteinDPOGS208099-PA771 aa
Genomic positionDPSCF300395 - 17421-22618
RNAseq coverage630x (Rank: top 20%)
Annotation
HeliconiusHMEL0160845e-6432.02% 
BombyxBGIBMGA001817-TA4e-4431.82% 
DrosophilaCG1647-PA6e-1345.78% 
EBI UniRef50UniRef50_B4M1846e-1424.57%GJ23048 n=3 Tax=Drosophila RepID=B4M184_DROVI
NCBI RefSeqXP_002053975.11e-1424.57%GJ23048 [Drosophila virilis]
NCBI nr blastpgi|1953906382e-1324.57%GJ23048 [Drosophila virilis]
NCBI nr blastxgi|1950541227e-1823.25%GH22430 [Drosophila grimshawi]
Group
Gene OntologyGO:00056347.1e-08nucleus
GO:00082707.1e-08zinc ion binding
KEGG pathway 
InterPro domain[20-84] IPR0129347.1e-08Zinc finger, AD-type
Orthology groupMCL26681 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208099-TA
ATGGATGGCTATCTGAATTCGTCGCTGGAGCAGCAGACCATATACGACATATACCGAGCATGTCGGCTGTGCGGAGCTGGCGCCGGGTATAAGATGCCCATCATGCAGAACGTCGTTCACTTGGACAGTTCTGAGGTTCACCAAGAGGACAAAATGCCCCCGTTGATATGCGAGCTGTGTGTGGATAAAGTGAATGATTTCTACGAGTTTCTAGAGATGTGCCGTCAGACTAACAAACGAACCCGGCTACGTCTGGGGCTGCCTCCTCAGACGATGCCACGAGGGGCACCGGATGCCGGGGACTGTATATTAGGTGTGACCGAACCAGTCTATTTGAATGAAGACTCCAATGAATCACTTCTGAAGCCCCGAGAGAGGAACTTCAAGGGCAGGTTCAAGAAGGAACACGAATCTAACAAGAAAGGATCCAGGGGTAAGGACGCCCGCATCGACCTGACACAGGCCACTCCGCCACGACACAGAGCCAACAAGAGAGTGCTGGACGAAGACGTGTCTCTCAGTATGTTACAGGTGGGTTCAACAAAGAGGTCGGTGAAGATGCAGAGGTCCATACTGAGGAACGATGAAATTAAGGTGGAAGAGGACAGCTCCCCGTCCAGGTCCAAGAGAGGTCAGGACTCGAAGGAGAAACAAGCCACTAAGAGAGTTAAGATAGTTGACAGCAAGGCGGCCCGGAAAGACAAGGTGGACTTCAGGTGCAAGACCTGCAAGGCTCACTTCACGACGAGCCGCTCCCTGGAAAGGCACACGCGCACTCACACGCTGAAACTCAAGAAGGCGAGCAAGGCGGTCAGCTGCAGCAAGTGTAAGAGAGACTTCCCGAGATTCTCGCCGTACCTGTTTGTGTGGTGCAGCACCGCCGCCCGCTACAACTGTTCCCTGTGCGGCCGGGCCATCAACAACGCGAGTAACCTGGCGGCGCACGAGAGAGCCTGTCGCAACAAGAAAGGCACGAAGTTCAAAGTGAGTCCTGTTGTCATGAAGCAGCTGAGACCGGTCCGAATACAGCTCCAGAGGTGTGACTCCCTGCTGGAGACGCGGAGAGGGGAGAGCTTTGATGTGTCCTCGGTACAAGAGAACTTTGGTCTGGACAAGAACTGTATCTACCCGTACTTGAGGAGGAGCATCAAGACTGAACCCGCTTACATGGTGAACATACATGAGGACATCGATGACTTTGATTTGAACCAGGAGTATGTGCACTGGGACTCGGACAGCACAGCATCAGACGCGGAGACGAGAGACACGGTCGGCTCCTTGACCGCTCTCACGTTAAAAACTATATTCTCAGACAAATTGATCGGGAAAGTGCCCAAGAGGAGACGAAGGCTGAGGAAGACGTCGGACTCGAACTTCAGTTCAGAGAAGCTTGGCATTGATAGTATCATAAACAGTTTGGACAAGAGAGACGATGACTCCCTGTTCGGTGATGAGAAGGTGGCTCCTGTCAGCAACGATGACTTCGACTCATTACTGGTCGAGCAGGAGAGTGATGGACCCAACGAACTGAACAGATTACGAGATGACTTACATGTTAGAGGCGACGACTCGAGTAACGAGGGTGTAGGGAATAATATAAGCAATGACTCGATAGGTACAGATACGAAAAGCAAAGACAACACTAATCATACAGGCCCGGGAAGTGAGCTGAGTCACCATCAAGCTACTGAGGACCTGAGGCGATGTGATGTGGACAGAGACAGTAGGAATGAAGCTACTACCGACAGCTCTGTGGATTGTACAGATAATAATACAGTGTGTGGCGGTAATGTTAGTGATAGGCTCGTCAATGATAATCACACGCGGACTGATGATAATGATACGACAGCTCATGATAATGATACGAGATATAATGACAAACTAGTTAATGATAATGACACGCGAAGTAAGGACACAGGAGATAATGATACGCTACATAATGATAATGATATGTCATTGAATGGTAACGTCACACGAGATGATCACACAAGGGATAATGGTGCATCAGTTAATGATAATGACACGCCAGTTAACGATAATGATACGATAGAAGATGAAAATGCAGACATTACTATTGAAAAGGTTGAGGAAAAAGTCCTTAGTAACAATGAGGTTAGTATCAATGATGATGAAGAAGAATATAAAGACGAACAAACTGATTTAGATGATGGAGAGGTGACCGAAGACATAGATGACCAGAAACTGATGGAAGCTCTGGATGAACAGCTGGGCGAAGAGGCCGGTGAGAAGGATGCCAGCACCGAGGACAAGGCTAATGCAGACCCGGTGAGTATATCCAGCGGTGAAGTGGACTGA

Protein sequence:

>DPOGS208099-PA
MDGYLNSSLEQQTIYDIYRACRLCGAGAGYKMPIMQNVVHLDSSEVHQEDKMPPLICELCVDKVNDFYEFLEMCRQTNKRTRLRLGLPPQTMPRGAPDAGDCILGVTEPVYLNEDSNESLLKPRERNFKGRFKKEHESNKKGSRGKDARIDLTQATPPRHRANKRVLDEDVSLSMLQVGSTKRSVKMQRSILRNDEIKVEEDSSPSRSKRGQDSKEKQATKRVKIVDSKAARKDKVDFRCKTCKAHFTTSRSLERHTRTHTLKLKKASKAVSCSKCKRDFPRFSPYLFVWCSTAARYNCSLCGRAINNASNLAAHERACRNKKGTKFKVSPVVMKQLRPVRIQLQRCDSLLETRRGESFDVSSVQENFGLDKNCIYPYLRRSIKTEPAYMVNIHEDIDDFDLNQEYVHWDSDSTASDAETRDTVGSLTALTLKTIFSDKLIGKVPKRRRRLRKTSDSNFSSEKLGIDSIINSLDKRDDDSLFGDEKVAPVSNDDFDSLLVEQESDGPNELNRLRDDLHVRGDDSSNEGVGNNISNDSIGTDTKSKDNTNHTGPGSELSHHQATEDLRRCDVDRDSRNEATTDSSVDCTDNNTVCGGNVSDRLVNDNHTRTDDNDTTAHDNDTRYNDKLVNDNDTRSKDTGDNDTLHNDNDMSLNGNVTRDDHTRDNGASVNDNDTPVNDNDTIEDENADITIEKVEEKVLSNNEVSINDDEEEYKDEQTDLDDGEVTEDIDDQKLMEALDEQLGEEAGEKDASTEDKANADPVSISSGEVD-