Monarch geneset OGS2.0

DPOGS208547
TranscriptDPOGS208547-TA3786 bp
ProteinDPOGS208547-PA1261 aa
Genomic positionDPSCF300064 + 995454-1010128
RNAseq coverage896x (Rank: top 14%)
Annotation
HeliconiusHMEL0087650.068.07% 
BombyxBGIBMGA010335-TA0.066.16% 
Drosophilaed-PA0.048.84% 
EBI UniRef50UniRef50_Q9BN170.048.84%Echinoid n=20 Tax=Diptera RepID=Q9BN17_DROME
NCBI RefSeqXP_002087894.10.048.91%GE18270 [Drosophila yakuba]
NCBI nr blastpgi|2700136320.054.52%hypothetical protein TcasGA2_TC012257 [Tribolium castaneum]
NCBI nr blastxgi|2700136320.054.52%hypothetical protein TcasGA2_TC012257 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.5e-10protein binding
KEGG pathwaycfa:4034403e-29 
 K06255 (HSPG2)maps-> ECM-receptor interaction
InterPro domain[657-731] IPR0137831.1e-23Immunoglobulin-like fold
[724-860] IPR0089578.1e-19Fibronectin type III domain
[741-835] IPR0039611.5e-10Fibronectin, type III
[155-236] IPR0131622e-09CD80-like, immunoglobulin C2-set
[644-733] IPR0130985.3e-09Immunoglobulin I-set
[343-426] IPR0035991.3e-08Immunoglobulin subtype
[349-408] IPR0035981.5e-06Immunoglobulin subtype 2
Orthology groupMCL11173 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208547-TA
ATGATAAAAAGGCATGTCGTCCGCAGTCCACGGGCTGGCTGCGCGTGCGACGACTCACTAACTCGACCCCACGTGTCATTGAACCCGCTTCTAGTCGACTACGTTATACTGACAATCCTGTGCGAGGACGAGCAGCACGATTCTAGGGAGGGTGACGATGTGTCGATGCAGTGTCGTTTCGACTTGGAGCCCGTGGCTGGAGAGTCTCTCACGTACTACTGGGTCAGGAGCACCTCCAGCGGTCATGACAATGTTGCCATCGGTAACATACCCCTGGAGACTAACTACCAGATAAACTACGCTCCATCGGAAGGTAAGTTCGACTTGCTGGTGTCCAATGTGACATACGAGCGTGACAACGGTCGGTTCGAGTGTCGAGTTAAGGCGGGCGGGTCGGGGCGGACGTTACATTCTCAAGGACATAGTCTGGTGGTCCTCACACAGCCCCAGCCTCCGATCCTCAACCCTGGACCTCAGGCCCAAGCGCAGGAAGGCAGGCAACTCAACCTGACATGCTCCAGCACCGGCGGATCCCCCAGTCCGTATATAAAATGGTACCGAGATGGTTCCATCCATCCCTTAGATGCTGAACTCATACCGGGTAAGACTCGCTTCGAGCCAACGTTATCAATACTGACGTTGGAACCAAATAGAGACGACAATGGAGCGACATTCCGCTGTGTCGTGAGGAACAGGGCCATGAAGGAGGGGAACCAGCTCGAGGCCACTGTTGAACTTAGTGTTAACTATTTCCCCCGCGTTGAAGTCGGTCCCGAAAATCCTTTGCGAGTCGAGATCGATGGGAACGCTAACATGGTGTGCAAGGTTGACGCCAAACCTAAAGTAAACACAGTCAGGTGGACGCGAGATGGGAAATATATATCAAACTCGTTCACACATCTGATACAAGGAGTGACCGTACAGGACGCGGGCAAATACATCTGCAGCGCTGACAACGGACTCGGACAGCCCGGAGAAAATGAGATATACTTGGACGTCCTGTATCCTCCCAGCGTCACAGTCGATTCGAAAACATACGAAGCCGAAGAAGGTGGCAACGTTGAAATAAGATGCGAAGTCTCCTCCAACCCCGACCCTATATCCATCGAATGGACCAAGGAAGGACGAACTGATTTCAGACAGAAAGGGAATACGTTAGTACTGACTCGAGTCGACGCCGACATGGCCGGAACTTACGTGTGTCGGGCCGTGAACGTGATCGCTACCAGTAGTGGTAATAAAGTGGAGAAAGCCGGCATGGCGTCCGTGGCTGTGCTCGTGAGACATAAGCCCGGACGAGCCTACATAAGTCCAGACCATCCGATAGCCCAAGAAGGTACCGGGGTTACATTGACCTGCAGCGCCAAGCCCCCCGGCTGGCCTGTACCGCAATATCGCTGGTTCCGCGAGATAGAAACGGTTGACATCAAACCTACGGTGTTGGCGACGGGGAACAAGTATACAATACCCAGCGCTCACCTGGGTAGCGAAGGCGTGTATCATTGTCAAGCTACGAACGAGCTCGGTCACAGCGAGCTGGCCACCGTCAACCTGGAAGTGTATCAGCCGCCTAGATTCCAGTCAAAGCTGCAACCTCACATGATAAAAAACTCAGGTGAAAGGAACTTCTCTCTCACATGCGTCGCTCTGGGCAAACCTCTCCCTAGCGTCAAATGGTACAAGGACAACTCGGAGATCCGACCCGACGCCAACATGTACGAGGTGAAGACCGAAATTAATAAGAGCAGCAACGCCGTCTACAACATACAGAGCGTTCTCAGATTCCACGGAAGGGCGAGGCCCAGCGGCGACGACCTGCTTCCTGCTGATAGAGGTGTCTATTCTTGCTCCTTTGAGAACGAAGTCAAGAAAGTGGAATCGTCGATGCAACTACGTATTGAACATGAGCCGTTATCGATCAAACAACAGCGGAAGGTGGCGTACGATTTGATGGAGAACGCTGAGATATCGTGTCGAGTGTTGGCGTACCCCAAACCCGAGTTCCAGTGGTACTACAGCATGAACCCCTCCCCGCTGCAGATGTCCTCGGACGGCCATTACGAGATTATAACGACGACCGACGAAAACGACTCGTACTTGAGCGTTCTCATCATTAGAAATATAAAGTCACAGGACTACGGCGACTACTACTGCAACGTTAAGAACACGCTGGGCGGAATACGGCCGCAGATCAGGTTACAGCCCAAGGGCGCGCCCGAACCGCCCAAGAACCTGTCTAGTCAGAAGGTTGACGCGACCTACGTCACCTTGAAGTGGGAGGAGGGTTTCAACGGCGGTCTGTCCAGTACGAAGTACTTCGTACTGTATCGAAGAGTCAGATCCATCAATGGCGAGCCGTGCGCGGTGCAGGGCGCCGACGAGTTCGACTGGAAGGAGTACGATTGCGGCCGGGCGAACCCCTGCAACGTCACCAGGCTAGAGCAACACAACTCCTACTACTTTAAGGTGAAGGCGGTCAATACTAAGGGTCAGAGTAACTATTCCAATGAAATTTCGGTGACGACGAGGGTTGATAAGATATTACCGCCGGAACAGGTATCTTATGACCCCAGGTCTAGTGTCGTGGGCTTCAGGGTGGGACCTACATGTCTCCCGCTTATGGGAGTCATCGAAAATTTGGTTGCCGATGGATGGAAGGTGATAGAAACTATGCCTCTTCGTCTGTCGGGAGTCGTGTCATCGGAACAGGATACGACATTGGATCAAGTGACCGTCGGCGGCCGAGGCGAGAGGAACTCTTACGACCCTAACATACGACTCAGGCTGAAACTGTGCCTACAGAACAACCAAAACGTCTGCAGCGAGTATGTTGAAGCTAAGATCGGTCCATCGATCACCAAAGAAGCGGTCGCCCTCACCACCGGCACCATGATAGCGCTGATCATATCCTGCGTGCTGATCGTCATGGGTTTCATACTCTTCGGCTTGTACTGTCGATGTAAATGTAAGGAGAAAAACAAGGGCTCGTCCAAAAATTATGTCGTAGAGGCCAAACGATCGCCCGTCGACTCGCCCAGGAACCATCCTCCCCCGCCCTACTACCCCACCACCGGCATGGAAAACAAGGCTCTGGAGAGTTCCATGGACGTGCCATCGATCATGGAAGACTCGAAGTACTCCTCGCAACCATACGGCTACCACATGCCCGCTCAGGACATACCGCCCACAGACTGGAACATCCAGTATCTAGAGAACAATTACGCCAACAGCAACAACGGCGGCAGTGTCAACTCCCAGGACTCGCTGTGGCAGCTGAAGATGGTCGCCGCCAACAACTCCTCGGGCATGTGTCACCCCATCATGACCTCCGACAGGCAGAGCAACTATGGCTACGACCCGATCAGACACGGCGGCTACGGCACCATCGATGACTACGCGCCCTATCCGCCGCTGCCGCTGGCACCGCACGGCCAGCACGGGCAGCTCGGCCAGCACGGCCAGCACGGCCAGCTGGCACCCCACGGCCAACACGCGCCACTAGCGCCGCACTCGTCCCACGGCCCGGGCTCGGATTACGCTCGCAACTCCCAGAACCCATCCAGACAAGACTACTGCTCGGACCCCTACGCCTCCGTTCACAAGCCCAAGAAACGGATGGATCAACATATCGAGTCCCCGTACCACGAAGTGAGCGGTCTGCCGGAGTTCCCCGAGGCGGCGGACGACAAGCCGGCGCTGTCCCTCAGCTACGACGAGTCCCTGGAGTCCGGGTACTCCACACCCAACTCACGCGCCCGCCGGGTCATCAGAGAGATCATCGTGTGA

Protein sequence:

>DPOGS208547-PA
MIKRHVVRSPRAGCACDDSLTRPHVSLNPLLVDYVILTILCEDEQHDSREGDDVSMQCRFDLEPVAGESLTYYWVRSTSSGHDNVAIGNIPLETNYQINYAPSEGKFDLLVSNVTYERDNGRFECRVKAGGSGRTLHSQGHSLVVLTQPQPPILNPGPQAQAQEGRQLNLTCSSTGGSPSPYIKWYRDGSIHPLDAELIPGKTRFEPTLSILTLEPNRDDNGATFRCVVRNRAMKEGNQLEATVELSVNYFPRVEVGPENPLRVEIDGNANMVCKVDAKPKVNTVRWTRDGKYISNSFTHLIQGVTVQDAGKYICSADNGLGQPGENEIYLDVLYPPSVTVDSKTYEAEEGGNVEIRCEVSSNPDPISIEWTKEGRTDFRQKGNTLVLTRVDADMAGTYVCRAVNVIATSSGNKVEKAGMASVAVLVRHKPGRAYISPDHPIAQEGTGVTLTCSAKPPGWPVPQYRWFREIETVDIKPTVLATGNKYTIPSAHLGSEGVYHCQATNELGHSELATVNLEVYQPPRFQSKLQPHMIKNSGERNFSLTCVALGKPLPSVKWYKDNSEIRPDANMYEVKTEINKSSNAVYNIQSVLRFHGRARPSGDDLLPADRGVYSCSFENEVKKVESSMQLRIEHEPLSIKQQRKVAYDLMENAEISCRVLAYPKPEFQWYYSMNPSPLQMSSDGHYEIITTTDENDSYLSVLIIRNIKSQDYGDYYCNVKNTLGGIRPQIRLQPKGAPEPPKNLSSQKVDATYVTLKWEEGFNGGLSSTKYFVLYRRVRSINGEPCAVQGADEFDWKEYDCGRANPCNVTRLEQHNSYYFKVKAVNTKGQSNYSNEISVTTRVDKILPPEQVSYDPRSSVVGFRVGPTCLPLMGVIENLVADGWKVIETMPLRLSGVVSSEQDTTLDQVTVGGRGERNSYDPNIRLRLKLCLQNNQNVCSEYVEAKIGPSITKEAVALTTGTMIALIISCVLIVMGFILFGLYCRCKCKEKNKGSSKNYVVEAKRSPVDSPRNHPPPPYYPTTGMENKALESSMDVPSIMEDSKYSSQPYGYHMPAQDIPPTDWNIQYLENNYANSNNGGSVNSQDSLWQLKMVAANNSSGMCHPIMTSDRQSNYGYDPIRHGGYGTIDDYAPYPPLPLAPHGQHGQLGQHGQHGQLAPHGQHAPLAPHSSHGPGSDYARNSQNPSRQDYCSDPYASVHKPKKRMDQHIESPYHEVSGLPEFPEAADDKPALSLSYDESLESGYSTPNSRARRVIREIIV-