Monarch geneset OGS2.0

DPOGS212597
TranscriptDPOGS212597-TA1365 bp
ProteinDPOGS212597-PA454 aa
Genomic positionDPSCF300245 - 205921-208263
RNAseq coverage627x (Rank: top 20%)
Annotation
HeliconiusHMEL0067864e-14873.17% 
BombyxBGIBMGA005183-TA4e-12667.71% 
Drosophila% 
EBI UniRef50UniRef50_D1ZZG82e-0927.27%Putative uncharacterized protein GLEAN_07458 n=1 Tax=Tribolium castaneum RepID=D1ZZG8_TRICA
NCBI RefSeq%
NCBI nr blastpgi|2700054076e-0927.27%hypothetical protein TcasGA2_TC007458 [Tribolium castaneum]
NCBI nr blastxgi|2700054073e-1428.52%hypothetical protein TcasGA2_TC007458 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL13526 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212597-TA
ATGCTGATGGTACATACTAACACTCAACCTATGAAGACCACGAAAATTAAAGGACCTCCACCTGTTCCTCCGCGACCCAGTCAGAGTATGGTGGCCGAAGCTCTGGCGAAGACGAGGAAAGCGGTCGCGGATTCCAAAGCTACGCTGAAAAGTCGCACTTTCACCAAAAATGAAATGACCCATGTTTCCTCGAGGGCCAAAACATTGGATAGGAATACGACGTCGATCAAACCAACCGTGTCGCCTCGACAAAATGGACTCGCGCGCGTCAAATCATTCATTAAAGACGTTATCAGTGACCGCAGCTCGTCATCGGACGAGAGGAAATCCAGCGGAGCGAACTCCAGGAGGAGTTCCAGCGACAGCAGCTCTATACAATCACCTACGTCGAATAAATCAAATGAATCTCTTAACTCCAAAGCCGTTAAAACATGTAGACAAATATTGATCCGATCTCTTTCTACGTCTAATAAATTAGATCAAGACACCAAGTGCAAGGTATCAAAATCATCTAGCTTCGCCGAAAAGACGCTTCCTTTACGTAAGGCACCACCGCCACCGCTGTCACCTAAACCTAAGCTGAAACCGATGAAACAACTTCCCGCACAGAAAACGAACACAGAACCTAGAAAGCAACACAAGGAACCGTGCGTCGCTCTTTATTCATTACCGGTGGATGCAAAGCCTATTGAAACGGCGGTCAGTGAAGCGTGCTACGCGACGGTCGAAGATATAGCAAAAGTTGAAGACAGACTACAGGTAGAAGAAAAAACGATAGCCAAAAACGAAAAAGACCAAGAACAGGAAGACATTAATAAGAACACGTTAGAAAGAAAGACCTCCAGGGTAACAGAGATGCTCATTTCGGAGATTCTGGCCAGCAGAAACAATAACAAAGACGAAGCTAACGCTGTTATAGATAAAGGGAAAGATTCCGAGAACACAAACGGGAATGAAGCTGAGAGTGAGAAAATGAAGGACATGAATAATCACGAAATGTTGATATATGAATTACAAACGATGAGATCACAGACCAACTTACCAGAGACTGTAGATGTCAAAGATGACATGCATTGTGACTCGGAGGAGGACAATTACGGTGAATCGGGTAACAGTGAATACGTGGAGGACGATGAAGCGTACGCCAGCATCAGAGACGACGACGACCTCAGATCTAGACTACGTAAGCAGTTCCGTGCTATAAGCACTATGTCGCTCCAAGGATTGCCTCCACTGCCGAAGAGTCTCAGCGGCTTCGCTGACGGCGACGATCTCTTGGAGCCTCCCGGGCCTGCACAGTCACAGGAACCCCCCACTGACCTTGACTCACAGTTGGTCTATTTGAAGAGAGAAATGGTAAGCTAA

Protein sequence:

>DPOGS212597-PA
MLMVHTNTQPMKTTKIKGPPPVPPRPSQSMVAEALAKTRKAVADSKATLKSRTFTKNEMTHVSSRAKTLDRNTTSIKPTVSPRQNGLARVKSFIKDVISDRSSSSDERKSSGANSRRSSSDSSSIQSPTSNKSNESLNSKAVKTCRQILIRSLSTSNKLDQDTKCKVSKSSSFAEKTLPLRKAPPPPLSPKPKLKPMKQLPAQKTNTEPRKQHKEPCVALYSLPVDAKPIETAVSEACYATVEDIAKVEDRLQVEEKTIAKNEKDQEQEDINKNTLERKTSRVTEMLISEILASRNNNKDEANAVIDKGKDSENTNGNEAESEKMKDMNNHEMLIYELQTMRSQTNLPETVDVKDDMHCDSEEDNYGESGNSEYVEDDEAYASIRDDDDLRSRLRKQFRAISTMSLQGLPPLPKSLSGFADGDDLLEPPGPAQSQEPPTDLDSQLVYLKREMVS-