Monarch geneset OGS2.0

DPOGS204509
TranscriptDPOGS204509-TA1815 bp
ProteinDPOGS204509-PA604 aa
Genomic positionDPSCF300205 - 196252-207690
RNAseq coverage376x (Rank: top 32%)
Annotation
HeliconiusHMEL0036090.086.78% 
BombyxBGIBMGA012513-TA0.082.80% 
DrosophilaFas1-PD2e-16849.58% 
EBI UniRef50UniRef50_P106750.056.41%Fasciclin-1 n=9 Tax=Neoptera RepID=FAS1_SCHAM
NCBI RefSeqXP_002426556.14e-17652.98%Fasciclin-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|1198820.056.41%FCN I precursor [Schistocerca americana]
NCBI nr blastxgi|1198820.055.76%FCN I precursor [Schistocerca americana]
Group
KEGG pathway 
InterPro domain[423-579] IPR0007821.5e-30FAS1 domain
Orthology groupMCL15993 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204509-TA
ATGACACTGTTCGCTCCAACCAACCAAGCCTTCCAAAGATACACCGGAGAAATGGACGATTCGCTTGTTCTTTTTCACATATCAAATTTGGCAACGACACTCACAAATTTGGACTATTCAATATCGTCAGAGCTAGACGGGAACCCTCCTCTGTGGATAACGAGGCGCCGGGACTCTATGCACGATGACATTTACGTCAATAATGCCAAAGTTGACATCACAAGGACGTACCAAGCCACCAACAGAAATGGCAAACAACAGGTGCTTCACGTAATAGACCAAGTACTTCAACCCTTGTTGCCTTTAGGGCAAAGCTCCGTCGGCACAACCGTGTATAACCCTAACGCCTACCAGTTTCTGGAACATTCGGATTTGTTCGATATTGGACAACATCGCCTCAGGTCGTTCCGTCAAAAAGTAAACCAAATCCCAAAGAAGGATGTTTTTGATGCCGACGGCAGACACACGTTCTTCATTCCAGTTGACGAAGGTTTTAAGCCTACAACGCGATCCGATCTGATAGATGGGAAGGTGGTAGATGGTCACGTCATTCCCCACCATGTCCTCTTCACGCAGGCTACTCCCGACGGTGAAGAGTTCAAGAGCATGGCCTTCAGTGATAACGTCAAAGTCATTATCTCCTTCACTTCCACCCCTAACGGCAAAAATGACATCACATACGTGAAATCAAATACAGTCGCCGGCGATGCGAAACATAGTCGTGGCGTTGTGCTGGCAGACATTGTAAGAGCTAACATTCCAGTTAAAAATGGAGTTGTGCATCTCATACAACGACCACTGATGGTGGTCGATACCACCGTCGTAGACTTCCTTAAGGAAAAAGAAGACGGACCTTTGTGTAAATTCTACGAAGTGATCATGGACTTAGGCGAGCACAATCAGTTTCTTAACGAGCTTACTCTCGCCAAAGACATCACACTATTTGCGCCTTCAAACGAAGCTTGGAACGAACCAAATGTGCAAAATATTATAAGAAATCACCAAAAGCTTAAGGATATCCTGAATCTCCATCTGGTGCGGGAGCGACTACCTTTGGACGCAATCATTCACAATAATATGAACCAGATATACCAAGCACCAACCGCACTGCAAAGGAAGTATCTGTACTTCAATGTTATCAATCATCAAAACAATCTGACTCTAACTGTGGAAGGCGGTGGTGTTAATGCGACCGTGACACAGACAAATATTGCTGCAACCAACGGCTTTGTTCATATCATCGACAGAGTTCTTGGTATCCCTTACACTACTGTGTTTGAGAAAATGAAGACGGATCCTATGTTGAACATTACTTACAACATGGGCAAGCGACAGATGTTTAATCAGCAGCTGAAAGAAATGGAGCATCGGTTCACATACTTCGTGCCTCGAGACCACGCCTGGCTCAAGTTCCAAATACGGCATCCCACTGCTTACAAAGCGCTGTTCAGAGAGGATTTCGGATATTACACTAAGCAGATTCTGGAGCGTCACGTCATCCGCTCGGAGCGCTCGTACACCGTCTCAGACCTTAAGCTGCTCGCAAACGAGACTCATCCTTTCGTGCTACCGACAACTCGCGACCCGCTACGCTTACGAGTCAAGGAATCCGATAAAAATTACTACGTTGAATGGAACGGCCATTGGATCCACGTTTTCCGACCCGATGTTGAATGCACTAATGGAATTATTCACGTGATAGACGAACCGTTTGTCCTGGAGAGTGACATTCACGTCACGGGTGGCGTCACTCGTTCGGCTTACACCTTCCCTCTCATTACACCTTTTATAATCGCTATTATCTTGAACAACTAA

Protein sequence:

>DPOGS204509-PA
MTLFAPTNQAFQRYTGEMDDSLVLFHISNLATTLTNLDYSISSELDGNPPLWITRRRDSMHDDIYVNNAKVDITRTYQATNRNGKQQVLHVIDQVLQPLLPLGQSSVGTTVYNPNAYQFLEHSDLFDIGQHRLRSFRQKVNQIPKKDVFDADGRHTFFIPVDEGFKPTTRSDLIDGKVVDGHVIPHHVLFTQATPDGEEFKSMAFSDNVKVIISFTSTPNGKNDITYVKSNTVAGDAKHSRGVVLADIVRANIPVKNGVVHLIQRPLMVVDTTVVDFLKEKEDGPLCKFYEVIMDLGEHNQFLNELTLAKDITLFAPSNEAWNEPNVQNIIRNHQKLKDILNLHLVRERLPLDAIIHNNMNQIYQAPTALQRKYLYFNVINHQNNLTLTVEGGGVNATVTQTNIAATNGFVHIIDRVLGIPYTTVFEKMKTDPMLNITYNMGKRQMFNQQLKEMEHRFTYFVPRDHAWLKFQIRHPTAYKALFREDFGYYTKQILERHVIRSERSYTVSDLKLLANETHPFVLPTTRDPLRLRVKESDKNYYVEWNGHWIHVFRPDVECTNGIIHVIDEPFVLESDIHVTGGVTRSAYTFPLITPFIIAIILNN-