Monarch geneset OGS2.0

DPOGS202925
TranscriptDPOGS202925-TA1869 bp
ProteinDPOGS202925-PA622 aa
Genomic positionDPSCF300220 - 116504-123464
RNAseq coverage447x (Rank: top 27%)
Annotation
HeliconiusHMEL0178210.076.96% 
BombyxBGIBMGA001913-TA0.060.81% 
DrosophilaCG6232-PB5e-5430.25% 
EBI UniRef50UniRef50_F4X3W61e-8032.81%Thrombospondin type-1 domain-containing protein 4 n=4 Tax=Acromyrmex echinatior RepID=F4X3W6_ACREC
NCBI RefSeqXP_973307.12e-6832.46%PREDICTED: similar to ADAMTS-like protein 4 precursor (ADAMTSL-4) (Thrombospondin repeat-containing protein 1) [Tribolium castaneum]
NCBI nr blastpgi|3763192600.060.81%thrombospondin type-1 domain-containing protein 4-like precursor [Bombyx mori]
NCBI nr blastxgi|3763192600.062.37%thrombospondin type-1 domain-containing protein 4-like precursor [Bombyx mori]
Group
Gene OntologyGO:00310123.8e-28extracellular matrix
GO:00042223.8e-28metalloendopeptidase activity
GO:00082339.3e-09peptidase activity
KEGG pathway 
InterPro domain[33-140] IPR0102943.8e-28ADAM-TS Spacer 1
[484-551] IPR0008847.6e-10Thrombospondin, type 1 repeat
[589-620] IPR0109099.3e-09PLAC
Orthology groupMCL16236 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202925-TA
ATGTTCAAGTTTGCTCTATTATGGAAAGCAGTGGTAGGTCGTGAAGAGGTGCTAGCATCGACTGCGACAAGAGACGTCCAGTGCGGGCGTCACCTTGTTTCTGGTCTGTTCTCCCGACCTCGTCTTCCCTTGGGATACTCCTCCGTCACCACAGTGCCGACCGGCGCTTGTAGACTTAATGTCTCCGAAGTTGTCGCCAGCGAAAATTACATCGCACTAAGAGTATCCAACGGTTCATATATTGTAAACGGTGAGTTTGCTGTTAGTGCTCCTGGTACGTACGAGGCGGCGGGTGCAAGATTTGTATACTCTCGGTCTGCCGGACTAGACAATATATTTGCTATGGGACCCATACACCAACCTATTGATATTATGGTTTTATATACGCAACCAAACCCGAACATAAAATACGAATATCTTACCGATTCGCTGCCTAATGAAACGAGCAACGACATTCCGAATGATATCACCAGTTCTGCCCATCATCAGATGCAAAGGCATCACCGCCATCACGTGGCAGAATCCCACACGAGGGCCTTAGAAAAGTCAGGACCATCTTTGTCGAAGCAACCAGAAGCAAGTTCTTCTGAAACACAATATCAATTGGACACGAATGTTATTGGCGATAGAATATTTGTATGGAAAGTAATGTCATACACGCAATGTTCAAGAACTTGTGGTGGCGGCCTTCAAATTGGGAAATTTAAATGTGTCGAAGTAGACGGCAAAGAAGACCGCGAAGTGTCTGCTGCTCATTGCAAAGGGTCTCCACCGCCGTCTCGACGTAGAAGATGTGGCGTTGTTCCTTGTCCCCCTCGATGGAGGGCTGCCGCTTGGGGTCCCTGCCCTGTTTGCGGTCCAGCTCATAGGACTAGAATAGTTGGATGTGTACAAGATCATGCTCGAGGAATCACAAAGATAAGCGATCAAAAATGTCCTCTACCTATGCCACCAAGTAAGGAACCTTGTGATATACCGAATTGTGATGGAACTGCTAGAACAGACAGCACACCACGTCTCGATGGGAGACGTCACGTAAATCCTTACGATCGTGTTGATGCATTCCATGAAACAAAACCAAAGGTGACACTCGCTAACAACGTATCTGCTATCGAGGTTCATCCAAAACATGAATCCAAGTACACTCACAGTGCAGCTGGTGGATGGCTGTATACTGAATGGTCTCAGTGTGTCGGTTGGTGTGCGAGCGGTGTTCAGTCCCGCGGCGTCCGTTGCTCGGAGCCTGATGGATGCTCCGCCCACGCCCCGGCCTCTTCGCAGGCCTGCTCAGTAAAGAAACCATGTGAGGCGCAATGGTTTACTGGAGAGTGGTCTCCATGTACCGCTGAATGTGATGGAAGACAAATAAGAGGAGTCATTTGTATCGGAGGAAACGGAAGGAGATTGAGAGATGGTTCCTGCAAAACACCAAAACCTGAAGCCGAAAGACCATGCGGTGGAACCTGTGCACCGTCTTGGTATCTCAGTGACTGGGGGGAGTGCACAGGTCCATGTGAGGCTGGTATTCAGACTCGTACTGTGTGGTGTGCTCGTGGCGGGGCGGGGGGTGCTGGTGGCGCTGGCTCAGCCAGAGATCTGGAGTGCGGCGGAGGTAGACCACCAGCTAGGAGGTCCTGTGTACCAGCTAGGTGCACCACTAGACCGGTGAAGCCGGCCGTGATCACACCCACAGACCCGCAACGAGTCACGAGACATCAAACTCAGAACACGCAGAACGTCAAGGAACGGTCGTATAGAAACGGAACGTGTACCGACAAGTTGAGCAATTGCTCGCTGGCGGTGCAAGCGCGTCTCTGTCACTACCGCTACTACGCTGATAGCTGCTGTCGCTCCTGTCACGGACGGTAG

Protein sequence:

>DPOGS202925-PA
MFKFALLWKAVVGREEVLASTATRDVQCGRHLVSGLFSRPRLPLGYSSVTTVPTGACRLNVSEVVASENYIALRVSNGSYIVNGEFAVSAPGTYEAAGARFVYSRSAGLDNIFAMGPIHQPIDIMVLYTQPNPNIKYEYLTDSLPNETSNDIPNDITSSAHHQMQRHHRHHVAESHTRALEKSGPSLSKQPEASSSETQYQLDTNVIGDRIFVWKVMSYTQCSRTCGGGLQIGKFKCVEVDGKEDREVSAAHCKGSPPPSRRRRCGVVPCPPRWRAAAWGPCPVCGPAHRTRIVGCVQDHARGITKISDQKCPLPMPPSKEPCDIPNCDGTARTDSTPRLDGRRHVNPYDRVDAFHETKPKVTLANNVSAIEVHPKHESKYTHSAAGGWLYTEWSQCVGWCASGVQSRGVRCSEPDGCSAHAPASSQACSVKKPCEAQWFTGEWSPCTAECDGRQIRGVICIGGNGRRLRDGSCKTPKPEAERPCGGTCAPSWYLSDWGECTGPCEAGIQTRTVWCARGGAGGAGGAGSARDLECGGGRPPARRSCVPARCTTRPVKPAVITPTDPQRVTRHQTQNTQNVKERSYRNGTCTDKLSNCSLAVQARLCHYRYYADSCCRSCHGR-