Monarch geneset OGS2.0

DPOGS210653
TranscriptDPOGS210653-TA2295 bp
ProteinDPOGS210653-PA764 aa
Genomic positionDPSCF300401 + 75916-89653
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0107960.065.58% 
BombyxBGIBMGA001798-TA0.057.48% 
DrosophilaCG31900-PA3e-3734.53% 
EBI UniRef50UniRef50_Q7QBZ93e-4928.94%AGAP002382-PA n=1 Tax=Anopheles gambiae RepID=Q7QBZ9_ANOGA
NCBI RefSeqXP_001844278.13e-4742.14%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3479677441e-4828.94%AGAP002382-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3123770903e-5727.85%hypothetical protein AND_11738 [Anopheles darlingi]
Group
KEGG pathway 
Orthology groupMCL19956 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210653-TA
ATGAACCGCAGCACGACACTCAAATCCTGGCTCGAAGACTCCTGGCTGAGGCCACCAGCCGGCATTCTGGTGCCGCTGAGGCCCTTGGCGCTGAATCGGGCTTTGGGCGTTTGGAACGATTTGGCCAATGAAGGCTTAAATTTGACAGACATCGTCATAGTCGGCTATGACTCCAATGGCGTTAATTGGAGATCCAGGCACAACCTCCAAACGTCCAGCGGAACTAATGGAGATAGGGCTGTCGGGGACGCTTTGTCTAAACTGTTATTGAATTACCAGGACGTTTACACTGATAGTTCGAACGACGGCACGATGAGGGCATTGGCTTCCGCTGCGAAACTTGTGCCGTATGACAGTGCTCTGTTCCTGGTGACTGACAAAGGAGCTGGTGACCCTCAGAGACTGCCTCTAGCTTTGAGGGCGTTAGTGGAGAAAAGGTTGAAGGTGTACACTATATGGACGGATCCCAGTCATCCATCAGTTGAGTCAGAGCTGGCACTGCAGGACTTGAGGAACATCTCCTCGCACACGGAAGGCGAAGTGTTGGCCTATTCATTACAAGTTATAGATGAAGACAACTCCAATCTGGCCTCAGAAGTTGAACTCCAACCATGGGAGCCAGTTTCATCTAATCAAGCCAGACGAGCTAGGATCAATAACCATCCGGATGTGGAAAACTTTGATACTCTATTGGTGAGACGTGGTAGAGCTGAAGCCCTTACCTTGGGAATACCAGTTGAAACAGGAGTGACAGCACTCCGTCTTCTGCTGGAAGGGGCGATTGATCACGCCGTATTATACCCACCCAACGATGGTCCTCAGATTGACCTGTACAATGAGACGTCAGTGAAGATGTATTCAGAATCGTCGACTATTGAGAGCATATCGCCACAGGAAGTCTATATCGTTGTCCCAGGATGGAAACTGTATGTAGACATGCTGTCAGTGTTGCCGGTGATGTCGGCGGGTGAGGACGTCGCTATGACCGGCATGTGGCACCTGAGTGTCAGGTGTGACACTTGCGATTACAGACTAACAGTCAGCGCTAGATCACACATACATTTCGACGTTGAATTCGATACTGAAGATTCGTTGAACATCAAAGTCACTGGGAATGTTGCTAGTGTGAGAGACTCCTCTGTTGTCGACGAGTATGGAGCGACTATCGAAAAACTATCATTCAGCTATCAACCATCAACGGAGGCTTTCGAAAACAAAATGGCGGATATCTTTACCAGTGTCCCAGTGAGAAATGTCTCGGGAAACAGGGCCTATGTCAAAATAGCGGGAAGAGATTTCAAAGGCGAAACTTTCGCACGAGTCGCTGGACCTATACACGGTGAATCTGAAGTGAGAATGGGAAGATCGGCTGCCATCGTCTTCCCAGAGAGCATCAACGATCTAGAGGTGGCTGAGGAGATGAACTCTAAGACGTATAACGAGAAAATACAGAATGAGAGCGATGTGCTGTTCCAGTCTGAGGTGCAATTACAACGTAATCCCGCTATGACAGCTGTTCAAATCGGTCTGAGTTCAAGACTTTACGGCTCTCCGGAAAGCAGATTACAGCTTCATTTCGAAGTCACTAACTTGAGGGATACATCCGTGTTCTATCGTTTCGGTGCTGCTGGTGAACTGAGGTTCTTGACCGGAATTAATCCAGAAACTCAAACAGTGGCTGCTGGTCAAACCGTCAACGTCATAGTCAGCTTATTGATAGCAAGCAACGCTCAGGTCGGAGCCAGGGACCTCATCAAATTCACCGCTTATGGTCAATCCGAACAAGTATCCCTATCTTCGTACGTGTACGTGGTGAGCAGCGGAGACAACATCAGAGACCTCACTCCGCCCGATGTCAGACATAACTTCCAAGGCACATGCCTAGGCAGACTGGGGAGCGACTGTGCTGAACACGTCTGGTCTACATCTGTGATCGCCAGAGACGCCATGGGAGGTCTTCTTCGGCTATCATCAACGCCAGCAGGTATATCCTACAACGCTGGCTTCATATCAGGTTCCAGGGATGAGATCATAGCCACGTACAGAGCCAACTGCTGCGCCCCGAGGGTTGTTGTAAATGCGGTAGACGCTTTTGGGAACACCAACACTTATACCATTGACATATCAAATTACATTAATGAAGCAACGATAGCCGCTATAGCCCTAGGTGTTATATTAATATTAATTTTAATATTTCTTATAATAATACTGACGTACTTCTGCGTAAAGAGAAGGAAGGAGTCCAGAGAGCTACCGACCTACTCAACTTCCAGATCATCGAGAAACATTACTTGA

Protein sequence:

>DPOGS210653-PA
MNRSTTLKSWLEDSWLRPPAGILVPLRPLALNRALGVWNDLANEGLNLTDIVIVGYDSNGVNWRSRHNLQTSSGTNGDRAVGDALSKLLLNYQDVYTDSSNDGTMRALASAAKLVPYDSALFLVTDKGAGDPQRLPLALRALVEKRLKVYTIWTDPSHPSVESELALQDLRNISSHTEGEVLAYSLQVIDEDNSNLASEVELQPWEPVSSNQARRARINNHPDVENFDTLLVRRGRAEALTLGIPVETGVTALRLLLEGAIDHAVLYPPNDGPQIDLYNETSVKMYSESSTIESISPQEVYIVVPGWKLYVDMLSVLPVMSAGEDVAMTGMWHLSVRCDTCDYRLTVSARSHIHFDVEFDTEDSLNIKVTGNVASVRDSSVVDEYGATIEKLSFSYQPSTEAFENKMADIFTSVPVRNVSGNRAYVKIAGRDFKGETFARVAGPIHGESEVRMGRSAAIVFPESINDLEVAEEMNSKTYNEKIQNESDVLFQSEVQLQRNPAMTAVQIGLSSRLYGSPESRLQLHFEVTNLRDTSVFYRFGAAGELRFLTGINPETQTVAAGQTVNVIVSLLIASNAQVGARDLIKFTAYGQSEQVSLSSYVYVVSSGDNIRDLTPPDVRHNFQGTCLGRLGSDCAEHVWSTSVIARDAMGGLLRLSSTPAGISYNAGFISGSRDEIIATYRANCCAPRVVVNAVDAFGNTNTYTIDISNYINEATIAAIALGVILILILIFLIIILTYFCVKRRKESRELPTYSTSRSSRNIT-