Monarch geneset OGS2.0

DPOGS203784
TranscriptDPOGS203784-TA1278 bp
ProteinDPOGS203784-PA425 aa
Genomic positionDPSCF300010 + 995273-996951
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0085353e-17973.78% 
BombyxBGIBMGA006563-TA1e-7258.10% 
DrosophilaCG15073-PA2e-1823.28% 
EBI UniRef50UniRef50_Q7PH836e-2026.72%AGAP003594-PA n=1 Tax=Anopheles gambiae RepID=Q7PH83_ANOGA
NCBI RefSeqXP_313351.21e-2026.72%AGAP003594-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582918152e-1926.72%AGAP003594-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582918154e-2426.97%AGAP003594-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00056345.6e-10nucleus
GO:00082705.6e-10zinc ion binding
KEGG pathway 
InterPro domain[17-82] IPR0129345.6e-10Zinc finger, AD-type
Orthology groupMCL25255 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203784-TA
ATGAACGACTTCGCCACTTTAGAGAAAACTATAGTGCGTGAACTGTCGTGTCGTTTATGTCTATGTACGGATGTTTCAAAATTAAAACCTTTAAATGCTGAAGTGAAACGTAAAATTAAAAGACTATTTGATGTAAATATTCGAACAGACGACAATCTTCCAAAGGCAATATGTCACGAATGCTTACAACAAGTGGCCGCATTACATCTCTACGCGGTAAAAGTGGAAAAGACACAAAAATTTTTGGATTTTCACAAAATGAAAACAAGTAACAAGCGTGAGGAACAAAACATAACTGAAAACAAATTGCAACCAATAATTGCAGCTCTATCAAAGCAAAAAATACCATCAACTGCTAATGATGTCACAAAGCAGGACAGTATACCCAAAACGCCACCAACAGAATTTCAACAAGGAGATGCTAGACCACCAAAACGGCCAAAGAAGAAATCACCATCTGACATGAAGTTTTTAGAGGTAATGCCAATGGAAGAGTTTGCTGTCACGGAACGTGTGGATAAATCTTTACTGAACAGTGGCACCTCTACTCATAAACCCCCACAAAACTTGAAAGATGTAGCAAAAACAGAAGCGGAAGATTATATGAACGACGTTTTGGATCCCATAGTCTTGATAGATGGAAGACCAGCCAAGCAAGGAGCGGCGCTGGATAGGCAAATTACCCTCTTCTACAAAATGGAGTGTTGCATCTGCCACGAAAACGGCTTTCATTTCAAGTCACTCATGAAACATTATAAAGACAGACATGGCGTGCCGGGATATGTTAGCTGTTGTGATAAAAAATTTCACTATTTCTATCCCAAAAAAATCATCGAGCATATGGCTTACCATTTACAACCGAATATATTTATGTGTTCGAGCTGTCATCATAATTTCCAAACATCCCAGCAATTAATTGAGCATCAATCTAATGGTGGCAGATCGGAAGGCAAAATCGTGTGTCCGCGGTGTTCGGAACGATATCCAACTTATCAGGAGCTAGGGTGGCATATTTTAACACACCGGAAAGATAAACTCCAATGCGATTACTGTGGAAAATTATTGAAACATCACCATAGAAAGAAAACCATCAATCATATGGAAGATGTCATTCTGTGTTCACAATGTATACGCTCCTTAAAGAGCATAGAAAAGCAGGAAAAAGAAATTAAGGAAAAAGTGAAAGCTAAAAGTAACGATGCTTTGACTCTACAAAAGTATCAGAAGTTCCGTCAAGCAATGGGATTATCTGCCGACGAAGATGCCTCAACAGATTAA

Protein sequence:

>DPOGS203784-PA
MNDFATLEKTIVRELSCRLCLCTDVSKLKPLNAEVKRKIKRLFDVNIRTDDNLPKAICHECLQQVAALHLYAVKVEKTQKFLDFHKMKTSNKREEQNITENKLQPIIAALSKQKIPSTANDVTKQDSIPKTPPTEFQQGDARPPKRPKKKSPSDMKFLEVMPMEEFAVTERVDKSLLNSGTSTHKPPQNLKDVAKTEAEDYMNDVLDPIVLIDGRPAKQGAALDRQITLFYKMECCICHENGFHFKSLMKHYKDRHGVPGYVSCCDKKFHYFYPKKIIEHMAYHLQPNIFMCSSCHHNFQTSQQLIEHQSNGGRSEGKIVCPRCSERYPTYQELGWHILTHRKDKLQCDYCGKLLKHHHRKKTINHMEDVILCSQCIRSLKSIEKQEKEIKEKVKAKSNDALTLQKYQKFRQAMGLSADEDASTD-