Monarch geneset OGS2.0

DPOGS209733
TranscriptDPOGS209733-TA2190 bp
ProteinDPOGS209733-PA729 aa
Genomic positionDPSCF300105 + 291489-297882
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0113563e-14556.89% 
BombyxBGIBMGA008948-TA6e-16255.08% 
DrosophilaCG4424-PA2e-2529.35% 
EBI UniRef50UniRef50_B4KE327e-2531.46%GI10316 n=1 Tax=Drosophila mojavensis RepID=B4KE32_DROMO
NCBI RefSeqXP_001953174.14e-2932.41%GF17340 [Drosophila ananassae]
NCBI nr blastpgi|1947413948e-2832.41%GF17340 [Drosophila ananassae]
NCBI nr blastxgi|1947413942e-3232.07%GF17340 [Drosophila ananassae]
Group
Gene OntologyGO:00056346.4e-15nucleus
GO:00082706.4e-15zinc ion binding
GO:00036761.4e-09nucleic acid binding
KEGG pathway 
InterPro domain[446-519] IPR0129346.4e-15Zinc finger, AD-type
[692-719] IPR0130871.4e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25556 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209733-TA
ATGGAGGAAAAAAGTTTTGTTTGGACATCTGATTTAACGAAAAGATTTTTTCAATTGCGTTTTGATAATGAATGGCTGTTCCGAAAAAAAAAGCAACCTTGGAGAGAATTCTATAAAATTTTACTAAAAAGTGGATTTCCAGAAGAGATGACACTCAATCATGTACGCAAGAAGTGGTCATATACATATGATTCATACAGAATAGCAAAGAAAACAAACAACAAACAGTGGAAATATTTCAAAATATTTGATAAACATTTTGGGAAAACTCAAGTTTTAGATAAATATGAATCATGGACCGATGAATGGCGACATAAATTAATAATATGCATATCTGAGGCCAAAGAAGTTAAACTTGATTTTCAAAATATGTGGAGGACAGTTGAAAATGCATTACGATGCCAGGATCTTCCGCTAGACTGTTGCATACAAGATATTAAAGGGTTATGGCATTACATTAGAATGACATTTAATAGGAAGTACAGATTGGTAATGAGGAATAGTATAGATTCTGAGGATTGGCCACTTTATGACCCCATGTTGGAGTATCACACTAAATATGAGCCAGAATATCTGGAACGCCTCAGCAGTATGTCTGCTGGTGGTATGGCTGCTATAGAGTTCCGCTTAAAACATAGACCTAGAGAGAAGAAAAAAAAAGATGAAGCAGATGATGAATTCCAATGGTCCAGAGATATCACGGAATCATTCATTCAGATTAGAATGCAGAATGATTGGCTTTTTAGGGACAGGAAATGGGCGTGGAGTAACCTGCGTCAGATTATGATAGAAGAGTACGGTTTCCCACATTGCCTGTCTAGCAGAGACCTCAGCAGGAAGTGGGCTGCAATATATGCTGAGTACCAAAAAGCTAAAGCGACAAACAATATCTCATGGATGTATTATTCTCTTTTTGAAGTTTATTTCGGAGAAAGCAGTATGAGTCTCAACCCTTTGCTTGGCTGGCAAGAAGAGTGGGTGATTAATTTAATAAGTACCAGAACAGAATTAGAACAATTGTTTAAAATGTGGGAAAAGAAAAAGGAGACACCGTGGCGAGAAGTGGAGAAAAAACTCAGGAAAATGGGAATTCCTTTGGATCATAGTCTTCTAGAAATAGAGGAAATTTGGCGGCACTTATTGAAGACTTTTAAGTGGAAGCAGAAATTCGCTAGCAAAGGTATACTCAACGAGCAGTGGCCGTACTACGAACACGTGTCCAGATATGTCGACCAGCACGAAGCAAAGGAGGCTAATGACGGAGATTTCGAAGACGACGTGAAGCTGTACGAGCTGAAGAAGATCGCCATGGAACCGAAGCATGAAGTGACCAATGTGTGCAGATCGTGCTCGAGCGACGATGGCTGTGTGAAAATATTTGAGGAAACAGACGACGAAGGTCTCGATGTGGCGTATAAGCTGAAAGTCATCGGTGGCATAGAGATACAAAGATCAGATACCTTACCCACCCAAATATGTCTTCAGTGTCTACAAGAGTTGGAGAACGCGTTCAAGTTCAGACGTCAGTGTCAAGAGGTGGACAAAAATCTCAGAAGCAGCTCCTCCTTCATCAAAGTGGAATTACAACTAGACGATAAACATCATACGAACGAAATCTGCGATGGAGAGAGACAGAACTATGAAATAGAGATGGATAGAGACGGCGTCACCATGGCAACGAAAAAAAAAACATCCCCGCAAATGAGACCCGCGAGGAAAGTTATAAGGAGGAAGAAGGTCCGCAAGTCCGAATACGAATATCTAAAGGTGTGCGAAGTGTGCGGGAAACACACCAGAAACCTCAAGGCGCACATGGACGTACACTCGAAAGACAAATGTTACTCGTGTGAAATATGCGAGAAGAAATTTAAATTCAAAAGCGGGTTGATAGTCCACAAAGCCACCCACAATCCGACACCCAAAAAGACATGCGAAGTCTGCGGGAAGAGCTTCCATATATTGTCTCAATACAGAAGACATTACGCCTACCACGCGAACGAAAGGAAATACGGTTGTGAGACATGCGGGAAAAGATTCAATTCTTTAGACATTTTAAAAGTCCACGCCAGAATCCACACGGACGAGAGACCGTTTAGCTGTTCCGAATGTGGTAAAACTTTCAGAACAGCCGGATGTGTGGGCAGACACAAGAGGATAGTCCACAGGAATGTAGGACTTCAAAAAATTTAA

Protein sequence:

>DPOGS209733-PA
MEEKSFVWTSDLTKRFFQLRFDNEWLFRKKKQPWREFYKILLKSGFPEEMTLNHVRKKWSYTYDSYRIAKKTNNKQWKYFKIFDKHFGKTQVLDKYESWTDEWRHKLIICISEAKEVKLDFQNMWRTVENALRCQDLPLDCCIQDIKGLWHYIRMTFNRKYRLVMRNSIDSEDWPLYDPMLEYHTKYEPEYLERLSSMSAGGMAAIEFRLKHRPREKKKKDEADDEFQWSRDITESFIQIRMQNDWLFRDRKWAWSNLRQIMIEEYGFPHCLSSRDLSRKWAAIYAEYQKAKATNNISWMYYSLFEVYFGESSMSLNPLLGWQEEWVINLISTRTELEQLFKMWEKKKETPWREVEKKLRKMGIPLDHSLLEIEEIWRHLLKTFKWKQKFASKGILNEQWPYYEHVSRYVDQHEAKEANDGDFEDDVKLYELKKIAMEPKHEVTNVCRSCSSDDGCVKIFEETDDEGLDVAYKLKVIGGIEIQRSDTLPTQICLQCLQELENAFKFRRQCQEVDKNLRSSSSFIKVELQLDDKHHTNEICDGERQNYEIEMDRDGVTMATKKKTSPQMRPARKVIRRKKVRKSEYEYLKVCEVCGKHTRNLKAHMDVHSKDKCYSCEICEKKFKFKSGLIVHKATHNPTPKKTCEVCGKSFHILSQYRRHYAYHANERKYGCETCGKRFNSLDILKVHARIHTDERPFSCSECGKTFRTAGCVGRHKRIVHRNVGLQKI-