Monarch geneset OGS2.0

DPOGS200970
TranscriptDPOGS200970-TA1194 bp
ProteinDPOGS200970-PA397 aa
Genomic positionDPSCF300431 - 28497-39427
RNAseq coverage603x (Rank: top 21%)
Annotation
HeliconiusHMEL0161950.089.22% 
BombyxBGIBMGA011057-TA2e-12484.80% 
Drosophilamspo-PA5e-9240.81% 
EBI UniRef50UniRef50_Q7JY806e-9040.81%M-spondin n=12 Tax=Drosophila RepID=Q7JY80_DROME
NCBI RefSeqXP_972867.16e-11251.34%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
NCBI nr blastpgi|910781041e-11051.34%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
NCBI nr blastxgi|910781044e-11550.97%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[330-382] IPR0008841.3e-07Thrombospondin, type 1 repeat
Orthology groupMCL12214 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200970-TA
ATGACTGTTTATCGAATGGTATTACATACCTATTGGACCAGGGAAAAATTTCCAAAGCACTATCCAGACTGGAGACCCCCTGCTCAGTGGTCGAAAGTATACGGTGTATCCCATAGCCGTTCATACGTGCTATTTCGTCTTGGAAGACGAGTTTCTCCATCAGTCCGTCAGTTTGCTGAATCAGGTAGATCTGATACACTCGGAGCAGCGCCTGGTACCCTTGATGTCTTCGGAGGAGCAGCCATCGGTCAGGGATTGGGCAGAGCCGAGGCTGATTTCTTTGTCGATGGAAATCACTCAAGGGTCTCAGTGATGACAAAGATGATACCGTCTCCTGATTGGTTCATTGGAGTTGACAGCTTTGATCTTTGCGTTGACGGGAATTGGCTGGACAGCATTACTATAGAGGTGGACCCATTGGACGCGGGAACGGACAACGGTTTCACGTTCACAGCGCCGAACTGGCCGACAGCCCCCCAGGGGGTCGCGTATCGCATCACATCCAAGTACCCCTCGCACCCCGCCGGCTCCTTCTTCTACCCCCACCTGAGGAGACTTCCGCCGATCGCCACGTTCCAATTTATTAAGCTACGGGAATACGAGCTCTCGGAAGTATTTCATCGTGACAGTGATGACCGTAAATATGACGTGCTCCAACTGGACAAACTAAACCACAATAATGTTGATGTTCCTGATGGCAACCGAGCACTCTCCATCGCTGAAGAAGTTGAAAGAGCTGAAGCACCAAGGATTTACTCGGACGGATCTCTGACAACGCCGGATAGCTCGTATTCATATTACTCCATTACCAACAACACAGCAAGTACTGAGGAGCCAAGATCAGCAAATGAGGTACCGAATCCTACAGCAAGCGAGCGTTCAGCTCTCCGGAGTCTGGCTAGAAGATACCGAGCTAGAAGACGACGTAAGCTTCGCGGAGACAGTACTGACCCGGCCGAGAGAAGGAAGCGAAAAAGAACAAGGCTCCTAGACTGTCGGGTATCTCAATGGGGTGAATGGAGTCCATGTAGGAATGACGGCAACTGCGTCGGATCAGCTCTAAGAACTCGAAGAATTATCCGTCGTCAACGACCTGGTGGAAGCCCTTGCCCTCCCACAGCGCAGTCCCGATGGTGTGCTACCAACTGCACCACACCACCTCCAATAGAGGACTGGCGAAACAACCTCACGTAA

Protein sequence:

>DPOGS200970-PA
MTVYRMVLHTYWTREKFPKHYPDWRPPAQWSKVYGVSHSRSYVLFRLGRRVSPSVRQFAESGRSDTLGAAPGTLDVFGGAAIGQGLGRAEADFFVDGNHSRVSVMTKMIPSPDWFIGVDSFDLCVDGNWLDSITIEVDPLDAGTDNGFTFTAPNWPTAPQGVAYRITSKYPSHPAGSFFYPHLRRLPPIATFQFIKLREYELSEVFHRDSDDRKYDVLQLDKLNHNNVDVPDGNRALSIAEEVERAEAPRIYSDGSLTTPDSSYSYYSITNNTASTEEPRSANEVPNPTASERSALRSLARRYRARRRRKLRGDSTDPAERRKRKRTRLLDCRVSQWGEWSPCRNDGNCVGSALRTRRIIRRQRPGGSPCPPTAQSRWCATNCTTPPPIEDWRNNLT-