Monarch geneset OGS2.0

DPOGS208266
TranscriptDPOGS208266-TA1890 bp
ProteinDPOGS208266-PA629 aa
Genomic positionDPSCF300079 - 122184-129641
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0021391e-1231.65% 
BombyxBGIBMGA006441-TA3e-5957.38% 
Drosophila% 
EBI UniRef50UniRef50_B0W3A53e-0723.68%Modifier of mdg4 n=1 Tax=Culex quinquefasciatus RepID=B0W3A5_CULQU
NCBI RefSeqXP_001843189.16e-0823.68%modifier of mdg4 [Culex quinquefasciatus]
NCBI nr blastpgi|1700306261e-0623.68%modifier of mdg4 [Culex quinquefasciatus]
NCBI nr blastxgi|1700306268e-0824.12%modifier of mdg4 [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[103-163] IPR0075885.2e-13Zinc finger, FLYWCH-type
Orthology groupMCL19124 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208266-TA
ATGAGAAATTTAAAAAAGAACGGCGCGTACTGGACTTGTTCTAGTCACCATAAGAAAGGCTGCAAGGCTAACGTCACAACAGACGACAATAAGAAACAATTAAGCGTCCTTAATACAAAACACAACCATCCTCCCCCTGATTATCCGTATGCTAGGAACAGCAAGAACGGTTGGACTTGCTCGTCCCGTTCCTCCAAGAAATGCAACGCTCTCTTAAGATTGTCAACTAAGGGAGTTCTCACTGTGATTAACTCGGAACACACGCACGAACCTCCACTATTCTATATAAACGAACACGAATTCGAATTTATTAAATCATCAAGGGGCACAAATGTACTTTTATACAACAGATACACGTACGCCAGGAATAACGCGAGTAGAGCCGGGGTGTCGTACGCTTGTTCGTCTCGTAGTTCCAAAAAATGCGGTGCCCAGGTCTTCTTAACAAACGAGGGTGTCATATCAGTGGTGAAAGAACAGCATAGCCACGAACCACCCAGGTTTTATATTATCAAAGTGGTAACAGCTGTAGACGGTGCGACTCTACTGATGCTAGATGGCTACGTCTTCACGAATCCGTCTCCCATGTCGGGAGGCGAGCGTTGGTACTGCTCCGGTAGAGTCAAGTGGAAGTGCGCTGTCTGTTTACATGTCAACGATGACTATGAACTCGTTTGCATATCCCATAAACACGGACATGAACCCCCTGTATATGAGATCACGCCTAGGCTGCGTGAATTTGACAACGGTAGGAAGAGGTACTGCCTGGGTGGGTACACCTTCTATAAGCACAAGCCCATACTGCGAGGGGATGCCAGTCGTTGGCTGTGCACCAGGCGGCAATGCCCTGTGTATTTACACCTGGATAGCGATCTGAACCTTCTTTACCGACCTTCCAGAGAACACCCGCATCCGCCCGTTTGTATTTACAGGAACGCAAGCGGGCGTGTCGATGAGGCAACGACGAGCACTTCCACTGGTCTAAGAGGTCCACGACGGAAAACTGCACAATATTACCGTGATTACAGAGCACGGAAGCGAGCGGAGCAGGAAAAAGAGTCGTTGTATAGAACTAGTGATCCTTCTACGACTGCTGATTCTTTTAGGAAGAAAAGAAAATCAGCCGCGGAATACCAGAGAGAGTACCGAGCACGACAAAAAGCCAAACGTAACAATATGCTCATTGATTCACTAGCTGTACTTCCATCAACTTCGACGGGGGGATTGACTAGCACTCATCAATCAACTACAGTAGGCCAACTAACAACTACTGGTTACGGCGGTGACTCTGTCGAACGGGAACACGTTCCTCATAATTGGTGGCCACACCTTCAGGAAAGAGATGAAAGCGAAACATTCACAGCGGAATATCTAGAAGACGACGCTAACGAGGGTATTGCGATGTTTCTAGAGACAAGCAAAGGCAGGACAGTCTTACAATATGACGGGTATAGATATCGTAAGGCATACAGATCTAAGAATGGCACTCGCTGGAATTGCACCAGTAAGAACTGTTCAGCGTTCGTTTATTTGAATGACCAAGACGAGATCATAATGACTTCCAAGTTCCACGATCATCAACGCTGGAATTCCATAGCAGAGAGTGTTGATCCAGATCTAAGTAATACAGCTGTAGTAATAACCTCGCGGAAGGGCAAAGAGATGCTTCTGTTCCGTCAATACACTTACAGAAAGCAATATGATACGGGCCTGAAAACAAGATGGGTGTGCTCTACTTTAAAAAACTGTCGCGCCTGCGTTTTCACAGACTCCAACAATCTTATAACATCTGCGTTCGAAGAACATGAACACGACCCACCTAAATATTATCTTAATCCTTCCCACATTTTGGAAGCTTTACGGGAGCCTATTGTATTTGAATCCGACTAA

Protein sequence:

>DPOGS208266-PA
MRNLKKNGAYWTCSSHHKKGCKANVTTDDNKKQLSVLNTKHNHPPPDYPYARNSKNGWTCSSRSSKKCNALLRLSTKGVLTVINSEHTHEPPLFYINEHEFEFIKSSRGTNVLLYNRYTYARNNASRAGVSYACSSRSSKKCGAQVFLTNEGVISVVKEQHSHEPPRFYIIKVVTAVDGATLLMLDGYVFTNPSPMSGGERWYCSGRVKWKCAVCLHVNDDYELVCISHKHGHEPPVYEITPRLREFDNGRKRYCLGGYTFYKHKPILRGDASRWLCTRRQCPVYLHLDSDLNLLYRPSREHPHPPVCIYRNASGRVDEATTSTSTGLRGPRRKTAQYYRDYRARKRAEQEKESLYRTSDPSTTADSFRKKRKSAAEYQREYRARQKAKRNNMLIDSLAVLPSTSTGGLTSTHQSTTVGQLTTTGYGGDSVEREHVPHNWWPHLQERDESETFTAEYLEDDANEGIAMFLETSKGRTVLQYDGYRYRKAYRSKNGTRWNCTSKNCSAFVYLNDQDEIIMTSKFHDHQRWNSIAESVDPDLSNTAVVITSRKGKEMLLFRQYTYRKQYDTGLKTRWVCSTLKNCRACVFTDSNNLITSAFEEHEHDPPKYYLNPSHILEALREPIVFESD-