Monarch geneset OGS2.0

DPOGS204414
TranscriptDPOGS204414-TA1776 bp
ProteinDPOGS204414-PA591 aa
Genomic positionDPSCF300002 - 605167-609805
RNAseq coverage1441x (Rank: top 9%)
Annotation
HeliconiusHMEL0062630.076.05% 
BombyxBGIBMGA007719-TA0.068.65% 
DrosophilaHcf-PC7e-6762.50% 
EBI UniRef50UniRef50_E3WQB73e-7161.43%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WQB7_ANODA
NCBI RefSeqXP_318042.48e-7157.21%AGAP004774-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123828091e-7061.43%hypothetical protein AND_04298 [Anopheles darlingi]
NCBI nr blastxgi|1582978876e-7733.80%AGAP004774-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[448-566] IPR0089578.9e-11Fibronectin type III domain
Orthology groupMCL22167 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204414-TA
ATGGAGTCGGACTCTGCTCTTCACGGTGATGTTCCCGATGGCGCGGAAATGTCCCCTTTAGAAGAAAACCCAACAGAGAGTTTAGAAGAAAATGGTGATAACAATGGAGCAATTGAGGAATCGGAATCCGCAGCTACAGAAGAAGACGCAGCAACAAATGGTGGTGCTCCAACTAGTAGCAGTGATGTTTTGGATCTAGAACCTGCTGGACAAGCTGTAGAACATGAAGAGCCCCTTCCGCATCAAGCTAATGCTGAAGCTGAAGAAGAAATGGACATCGATGAAACTACCCCGGGAACTGTTGATGAATCAGCATATATCGGAGACAATTGCTTATCTACTCCGGCGGAAACTGAAGACAATTCACCACAAGAGGAGCACTCGTTACTTAAGGATCACATGTTAGAAGGCGAGGGTGATGGAGAGGGTCTTGAAGGAGCCCAGGAGGAGTCTCCCGACCAATCTATCAGTTCGGCTTTACCTATTGAAGGTGACGGAGCCCCTATCATACAGGATGAAGAATCCAGTACAATGGATGAAGATATGGGCGGTGGCGAGGGTGTAGCCAGCAGTGACGATGTCAATGACATAAGCAGTGCTGCAGCAGAAGTTCTAAGCACCGGCATCAGTTCAAGTACCCAAGAAGGTGCAGACATTACAAGCAGTGGGCAAACTGAGGCGGCCCTGATATCATCTACGGCTAACGGCCCTGCATTACTGCATTCCTTCTCTGTACTGCCTCAACAGCAGCAAGCTAATGAATTCAGTGATGCTGATACATCTGAGATGGAAGGTGCCGCGGACACAATGCCCTCAGTCAGTGAGTCAATGCCATTGCTCACTATGACGGCTAATGGATCAGCAATACTTTCACCAAATTTACTCCAAGGTGATGAAAGTGGAGCTGGTGTGTCCTCGTCGGTGGCGGGACTCAGTTCTGAGAGCGGTGCCGGCGAGGGTGAAGGCGCTGTGAGCAGTTCCGGCGCCGCTCAATCCGCCAAAGGCGCCCCACCCCTACCGCCCACAGACGCCGCACATGCGCTCGCTACTCTCGCTAGTGCAGCGCTGCATCACCAACATGAACAGAATGAACCAGAAGACCAGAAGCCACAAAACGATGAGGATGTCTGGTACACGGTGGGCTTTGTTAAAGGAACCACATTCACAGTACAAAATTACATATCCGATGCAAACGTGGATCTGTCGAGTCTCTCCTTGGACAGTCTACCTGACCTGTCCAATTTACCGACCACCCCGCTGGAACACGGCACGGCATATAAGTTTAGAATTGCTGCCATCAACTCGTGCGGGAAAGGAGAATTCAGTGAAGAGGCGGCGTTCAAGACCTGCCTGCCAGGTTTCCCGGGAGCGCCGTCCGCCATCAAGATATCCAAGTCGGTGGAAGGCGCTCACCTCTCATGGGAGCCGCCGCAAGTCGCCGCTGATGGAATCTTTGAGTACTCAGTATACCTGGCTGTGCGATCTAATCCACAACCAAAGGAGGCCTCTAAGTCTCAGTTGGCGTTCGTGCGCGTGTACTGCGGCAAGGCGAACACGTGTGTGGTGGGTCAGGCTTCGCTGGGCGCGGCGCACGTGGACTCCTCCACCAAGCCCGCCATCATCTTCAGGATCGCGGCCAGGAACGACAAGGGATACGGACCAGCCACTCAGGTCAGGTGGCTTCAGGATATAAAATCTACGGGAGTGAAGAGAGCCGGTGAAGGCCGGCTGCCAGGCGCCTCGCCTTCAAAGCAACCAAAACAACTGCTGTACTAA

Protein sequence:

>DPOGS204414-PA
MESDSALHGDVPDGAEMSPLEENPTESLEENGDNNGAIEESESAATEEDAATNGGAPTSSSDVLDLEPAGQAVEHEEPLPHQANAEAEEEMDIDETTPGTVDESAYIGDNCLSTPAETEDNSPQEEHSLLKDHMLEGEGDGEGLEGAQEESPDQSISSALPIEGDGAPIIQDEESSTMDEDMGGGEGVASSDDVNDISSAAAEVLSTGISSSTQEGADITSSGQTEAALISSTANGPALLHSFSVLPQQQQANEFSDADTSEMEGAADTMPSVSESMPLLTMTANGSAILSPNLLQGDESGAGVSSSVAGLSSESGAGEGEGAVSSSGAAQSAKGAPPLPPTDAAHALATLASAALHHQHEQNEPEDQKPQNDEDVWYTVGFVKGTTFTVQNYISDANVDLSSLSLDSLPDLSNLPTTPLEHGTAYKFRIAAINSCGKGEFSEEAAFKTCLPGFPGAPSAIKISKSVEGAHLSWEPPQVAADGIFEYSVYLAVRSNPQPKEASKSQLAFVRVYCGKANTCVVGQASLGAAHVDSSTKPAIIFRIAARNDKGYGPATQVRWLQDIKSTGVKRAGEGRLPGASPSKQPKQLLY-