Monarch geneset OGS2.0

DPOGS214400
TranscriptDPOGS214400-TA1947 bp
ProteinDPOGS214400-PA648 aa
Genomic positionDPSCF300069 - 168751-173013
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0106082e-11239.74% 
BombyxBGIBMGA011247-TA2e-6462.50% 
DrosophilaMeics-PA1e-2933.76% 
EBI UniRef50UniRef50_B4GJE33e-3032.76%GL26275 n=7 Tax=pseudoobscura subgroup RepID=B4GJE3_DROPE
NCBI RefSeqXP_002019261.16e-3132.76%GL26275 [Drosophila persimilis]
NCBI nr blastpgi|3584136611e-3035.37%PREDICTED: zinc finger protein 37 homolog isoform 2 [Bos taurus]
NCBI nr blastxgi|1951567533e-3632.41%GL26275 [Drosophila persimilis]
Group
Gene OntologyGO:00036761.1e-10nucleic acid binding
GO:00082709.7e-05zinc ion binding
GO:00056229.7e-05intracellular
KEGG pathway 
InterPro domain[543-574] IPR0130871.1e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL35019 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214400-TA
ATGTTACATAATACGGGAACATTTGATCAGAATGAAGTGCCAATTAAACAAAACAATTTATCGTCTGAAGATAGAATTATATCTAGTAACAATATAATGTCCAAAAGTGATCAATCAGGACAAATACACTTCAGTGTTCCGTACCAGGTGAACTTAGGATTGACTCCATATAACCATGGATTCGAAGTGAGTGCACAGCGTTTTCAAAACAATGGTCTCCCATTGAATTTGAATGTACAGAATAATTCAGTTTTAACCAAAAATTATCCACAAGTGTTTACAAAAGCGTATCACAAACAAATGGATATAGATTATAATTACTTAAATAGCTATAAGCAACCTAATTACGATGTAAATCATGTAAACACACCCAGCATGGCGCTAAGTTTACGTAAACACAATGAAGATACAAAAACAAGTGATATTAAACCACAGAGTGATATATTAACAAAAATTAGTCAAGAATCTAGTTATAAAATTGAAGTAGATTTGTCAAGAAAACCTGTAAATAATGTAAATACAGATAGTATCACAGATTTAAGTAATAAATTGCTTAGAGATGATTCCAGAAAAAGGTTAGCGAATACAGTCAAGATCATAGAAAATATTATATCACATCCGTCAAACTCGAGGAAGATTAAATTGGAACATAACATTAAAACCGAAAACTATGAAGACTTGTCGGAAACATCGGATACAGATTTTGTTATAAGTGAATCTGATTATAATGATGCAGATGATAAGAGAAGCATGAGCAAATCTTACAAACCGGGAATAAACCATCTCGACACACTATGCAGTACAGTGAAAACCGAGAATTGGTTGTGTGATGATGATAAGGCATCTAACGAAGACGTACAAGAGACTTCTACTAGAACGGTTGTGGTGCCGTCTATACTAAATATAGAGCAGATCAATCCATTCCTAAAGGACACACACGGGGTGCAGAGCGATGAGACCAAAAACTATGACATCATCGGAGCCGAGAGCAGCATAAATGTTGCGAAGGAAATTATTAAAAAAGGTGCCTCACTGATAGCGACGTACTTTGAATGCCCGCACTGCAAGCTCTTCTTCAACAATCCAAAGAGGTTCATAATACATACAAAGTGGCATACGTTCGGATACATAACCGAAAAGAAACTGAGGTCGCAAAAGGAGAAGATAGTTAAGCCTAGAAGCAGGAAGTACAAAGTTAGCGGAAGTGACGACAATACCGAGGAAGGTATCATACCTTGCACCGATTGCAAGAGAACGTTCAGCAGTACACCGAGTTTGATGAATCATAGACGGAAATATCACCCGACACGTCTAAGAGAATGCAAGATCTGTGGTAAGACGATGGTGGGCCTGGCAGCTCTGAGAGCGCATGTAACTACTCATACAACAGAATCTAGGTTCCAATGCGAAGATTGCCCAAAATGGTTCAAATATGCCCACTCGTTGGCCAAACATAGAGATACACATCTGGAGAAAACCGAGGAATGTCCTCAATGCCCAAAAAAGTTTGGCTCGACGGCCCTGCTTAATGTACACATGAAGACCCACGAGAGGGTGCTCCGGGGAGCTACATTCAGATGTACCTACTGTGGGAAGGGATTCTTCGAAAGTTACAGTCTTCAGGCTCACGAACGAACACACAGGAATGAAAGGCCGTTTGTGTGCGAGATATGTAACACAAGTTTCGGCACAAACAGCAGTCTCAAGCGGCATCTCAAAGTTTCTCACAGCACATCAAAGCCATTTGAATGCACAACCTGTCACCGATCATTTGTGTCGGAGAACATCAGGGACAGGCATTTCATCAGATACCACGGGGACCCAGAAGAGTTCAAATTCATGTGCAAATTGTGCCCGTGTAAATATTTAAATGCTAGGGAGTTGAGAAGGCATATATATAAAGTTCATCCCAAACCTAAGATTAAGGTGGAGGTGGGCAGTGACTGA

Protein sequence:

>DPOGS214400-PA
MLHNTGTFDQNEVPIKQNNLSSEDRIISSNNIMSKSDQSGQIHFSVPYQVNLGLTPYNHGFEVSAQRFQNNGLPLNLNVQNNSVLTKNYPQVFTKAYHKQMDIDYNYLNSYKQPNYDVNHVNTPSMALSLRKHNEDTKTSDIKPQSDILTKISQESSYKIEVDLSRKPVNNVNTDSITDLSNKLLRDDSRKRLANTVKIIENIISHPSNSRKIKLEHNIKTENYEDLSETSDTDFVISESDYNDADDKRSMSKSYKPGINHLDTLCSTVKTENWLCDDDKASNEDVQETSTRTVVVPSILNIEQINPFLKDTHGVQSDETKNYDIIGAESSINVAKEIIKKGASLIATYFECPHCKLFFNNPKRFIIHTKWHTFGYITEKKLRSQKEKIVKPRSRKYKVSGSDDNTEEGIIPCTDCKRTFSSTPSLMNHRRKYHPTRLRECKICGKTMVGLAALRAHVTTHTTESRFQCEDCPKWFKYAHSLAKHRDTHLEKTEECPQCPKKFGSTALLNVHMKTHERVLRGATFRCTYCGKGFFESYSLQAHERTHRNERPFVCEICNTSFGTNSSLKRHLKVSHSTSKPFECTTCHRSFVSENIRDRHFIRYHGDPEEFKFMCKLCPCKYLNARELRRHIYKVHPKPKIKVEVGSD-