Monarch geneset OGS2.0

DPOGS202937
TranscriptDPOGS202937-TA1773 bp
ProteinDPOGS202937-PA590 aa
Genomic positionDPSCF300220 + 106339-113908
RNAseq coverage265x (Rank: top 40%)
Annotation
HeliconiusHMEL0178213e-17551.34% 
BombyxBGIBMGA001906-TA2e-14145.99% 
DrosophilaCG6232-PB9e-5028.45% 
EBI UniRef50UniRef50_F4X3W65e-7132.15%Thrombospondin type-1 domain-containing protein 4 n=4 Tax=Acromyrmex echinatior RepID=F4X3W6_ACREC
NCBI RefSeqXP_393459.32e-6032.35%PREDICTED: similar to thrombospondin repeat protein 1 [Apis mellifera]
NCBI nr blastpgi|3763192603e-12943.25%thrombospondin type-1 domain-containing protein 4-like precursor [Bombyx mori]
NCBI nr blastxgi|3763192602e-13843.59%thrombospondin type-1 domain-containing protein 4-like precursor [Bombyx mori]
Group
Gene OntologyGO:00310126.8e-27extracellular matrix
GO:00042226.8e-27metalloendopeptidase activity
GO:00082334e-08peptidase activity
KEGG pathway 
InterPro domain[35-144] IPR0102946.8e-27ADAM-TS Spacer 1
[560-590] IPR0109094e-08PLAC
[200-263] IPR0008847.8e-08Thrombospondin, type 1 repeat
Orthology groupMCL44095 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202937-TA
ATGGCGCCTCTATCGAGATTATTCATCATTTTAGTGATTACCGTGGTAGGTGGTGAGGTACTGTCGGTGGTAGGTAGTCGCGATGTTCGCTGCGGGCGGCGGCTGGTATCCGGTCTGTTCGCACGGCCACGTCTACCACTCGGCTACTCCTATGTTACCACCGTCCCCTCTGGCGCCTGTCGACTCAACGTCTCGGAGATACTCGCCAGCGATAATTACATCGCTTTGAAAATAACGAATGGTTCGTTTATAATGAACGGTGAATTCGCCGTCAGTAGTCCTGGTACATACGAGGCAGCTGGTGCAAGATTCGTTTACAGCAGAAAAACAGGTCTAGATTCAGTGTATGCACTTGGACCTACCCACGATTCTATCGATATTATGATATTATACACTCAACCAAATCCAAGTATAAAATACGAATATTTCACCGAATCGTTGCCAGGTGAAGTTGAAACTGAATCATTGACAGTGTCCCCACCAGAACCAACTGTCGTACCCAAACATTCGAGACGTCATCACGGTATAGAATACGCTAAAGCAGGAGCCCGGCATTTGGATCCAGGAGTCAAAGATAAAAACAACGTTGAGGAAAATGTCGTAGCTGGAAGAAAATTTGTATGGAAGATACTTGCGTATACTCAGTGTTCTAGAAGCTGCGGTGGTGGTATTCAGCTAGGAAAATACAGGTGCGTAGAAGTATCCTCTAGTGATTGGGAGGTGTCCCCAGCACATTGTTTGGGTTCCCCTCCGTCAGGTAGACGTCGTCGTTGTGGAACCATTCCTTGCGCTCCGAGATGGCGGGCCGCTAGCTGGTCTCCATGCCCGTCCTGTGGACCAGCGACAAAGAATAGGATCGTTGGATGTGTACAAGATCATTCAAGAGGAATTACTAAGGTAAGCGATCAAAAATGTTTGGCTTCAAAACCGGCGACCACAGAAGATTGTAACATCCCAGATTGTAAAAACCCTGGAATACGGCACACAGAGGCGAAGCCCCAGGAACATACAGATGCCTTCCACGATGGTTCAGTGTACACAGTCGATGTTAATACCACGGACACGGAACTTGGACCAGAATATAGTTTCACTTCCATTAGAGGATGGCTTTTCACCGATTGGTCTGAGTGTGTAGGATGGTGTGTAGGCGGTGGTTTGAAGACCAGGTCTGTTCGATGTGGTGATCCCTCAGGTTGTGCAGGACCCTCCCCGGAGACGTCTCAAGACTGTGTCCCTTCAGTGACATGTGAACCCCACGATGGCCGCTGGTTTGCAGGGGACTGGTCGAAGTGCTCGTCCCCTTGCGGGAAGCAGATCCGAGTGGTGTTATGTATCGGAGGTACCGGAAGGCATCTGAGGGACTCCGCCTGTAGGGACCCTCGGCCAGAACACGAGAGGAACTGCCCCGGAGAATGCCCAGCGACGTGGTATTACAGCGAATGGGGTCAGTGTACAGGTAACTGTAGTATTGGCCTGGGCGTCCAACGCCGATGGGTGTCGTGTGTGAGGAATGATGTCACCGTCAGCGAAACTGAGTGTACGACACCACCACCGACACCACACAGATCCTGTATCCCGTCATGTATACCACCAGATCTCGTTATAGAGTCTCAAAAATCAACGAACGATCAATCGACAATGAAGCCGAGACCACAAACAGTTCCATCGGGGAAAGACTGCGAGGACAAATTGACGAACTGCGCTCTAGCCGTACAGGCGCGACTGTGCCATTACAAATACTACATCCACAACTGCTGCGATTCTTGTAAATAA

Protein sequence:

>DPOGS202937-PA
MAPLSRLFIILVITVVGGEVLSVVGSRDVRCGRRLVSGLFARPRLPLGYSYVTTVPSGACRLNVSEILASDNYIALKITNGSFIMNGEFAVSSPGTYEAAGARFVYSRKTGLDSVYALGPTHDSIDIMILYTQPNPSIKYEYFTESLPGEVETESLTVSPPEPTVVPKHSRRHHGIEYAKAGARHLDPGVKDKNNVEENVVAGRKFVWKILAYTQCSRSCGGGIQLGKYRCVEVSSSDWEVSPAHCLGSPPSGRRRRCGTIPCAPRWRAASWSPCPSCGPATKNRIVGCVQDHSRGITKVSDQKCLASKPATTEDCNIPDCKNPGIRHTEAKPQEHTDAFHDGSVYTVDVNTTDTELGPEYSFTSIRGWLFTDWSECVGWCVGGGLKTRSVRCGDPSGCAGPSPETSQDCVPSVTCEPHDGRWFAGDWSKCSSPCGKQIRVVLCIGGTGRHLRDSACRDPRPEHERNCPGECPATWYYSEWGQCTGNCSIGLGVQRRWVSCVRNDVTVSETECTTPPPTPHRSCIPSCIPPDLVIESQKSTNDQSTMKPRPQTVPSGKDCEDKLTNCALAVQARLCHYKYYIHNCCDSCK-