Monarch geneset OGS2.0

DPOGS216174
TranscriptDPOGS216174-TA5232 bp
ProteinDPOGS216174-PA1743 aa
Genomic positionDPSCF300155 + 303768-321343
RNAseq coverage1723x (Rank: top 7%)
Annotation
HeliconiusHMEL0165580.051.11% 
BombyxBGIBMGA014161-TA5e-16050.00% 
Drosophilayl-PB8e-17928.11% 
EBI UniRef50UniRef50_E2FLQ40.047.83%Vitellogenin receptor n=3 Tax=Bombyx mori RepID=E2FLQ4_BOMMO
NCBI RefSeqNP_001184180.10.047.83%vitellogenin receptor [Bombyx mori]
NCBI nr blastpgi|3087369740.047.83%vitellogenin receptor precursor [Bombyx mori]
NCBI nr blastxgi|3087369740.047.69%vitellogenin receptor precursor [Bombyx mori]
Group
Gene OntologyGO:00055154.1e-14protein binding
KEGG pathwaycfa:4787818e-103 
 K06233 (LRP2)maps-> Hedgehog signaling pathway
InterPro domain[300-567] IPR0110421.2e-53Six-bladed beta-propeller, TolB-like
[73-112] IPR0021724.1e-14Low-density lipoprotein (LDL) receptor class A repeat
[203-932] IPR0090304e-07Growth factor, receptor
[365-407] IPR0000336.7e-06LDLR class B repeat
Orthology groupMCL15409 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216174-TA
ATGGCAGTTCTCCTGTTATTATTTTCGTTGCTGGCAGTCTGTTCCGCGCAGTTTAAGGATGACTTGCAGACTTATGAGACGGAGTGTCTCATGGAAGGTTCATTTTCTTGTACATCAGGAGCGTGTATACCGTCGGAGAAGTATTGCGATGGGTATAACGACTGTGAGGATGGAGGCGATGAGAACTTTTGTGCCAATCACCGTCCAGATGCTCACTTGTGTAACGAGACTCACCAGTTTCTATGTACGGACGGGTTGATGTGTCTACCGAGCTCTTGGGTGTGCAACTATGAGAACGACTGCAAAGATGGTTCCGATGAAATGGGCTGTGAAAAGATAATCCAACATGATAATTCATCATGCAAAGGTTTCCAATGCGACGGCGGGAAGCTGTGCATATCAGATCTTTGGATGTGTGACGGTTATTACGACTGTGCTGATAAAACAGATGAAGACGTCGTGGACACTTGTCATCATGCACCTCGTCCGAAGCATCTTCATGATACACTGAACTGCGAGGTCAGATCTATGAACTACACGTGCTTGGATAAATCGTACTGCATACCGTACAATAACATGTGTGACGGTCTGAAGGATTGTAGAGACGGTAGTGATGAAGGGGCATTTTGTGCCGAATGGTCCAAGATGTGTGCAAACAGAACCTGTCCCAGGGAGTCGTTCTGCAAGCCCACTCGTAATGGTGGTACTTGCGTGTGTAACTCCTACAAGGAGTACAATCCCAGCAGTGGGGAGTGCGAGCGTTCCCGACAGTGTCTCCAGGAAGTGCCCGTCTGTTCCCACATGTGTGAGGATATGGGGGACTACTTCAAATGCACTTGTGAAGATGGATACAGATCTGATCATGCTCAGTATCTGTGCTTTGCTCCTGGTCCCGAAGCGATGTTGTTCTTCAGTACCCAAAACTCTATACAGTATGTGACAGTCAAATCGAATCATAGCGTCACCGTGTTGACTGGAATCAAAAAGGCCCACGGTGTTGCATACGATGGCAAATATTTATACTGGGTCGAAACAGAAAAAGGACATCAGGCCATCATGAAGGCACAGCTAGAAGACGTCGCTGGAACCAAACAGGTGCTGGCTGCCCTGGGTCTTGAGGATCCGGGAGATATAGCCGTGGACTATCTCGGTGATAACATCTACTTCAGTGACACAGCACGCGGCTGTATCACTGTGTGTCGGACTGATGGCGCTCTGTGCGTCACGCTCGCCGCACACACGAGGAGACCAAAGTTTGTTACACTGGATCCAAGGAAGGGCGTCATGTATTGGGCTGATAAGCATGACAAGCCGGTTATAATGAAAGCTAAGATGGATGGCTCGGAGTCGGAAAACCTTGTGCACGAGCTGAGTACTTTTGCGAAAGGCCTTGCTTTGGACGCACCGAACGGAAGGTTGTACTTCGTTGATGGAACTATTAAAGTTGTTATACTGAATGATAAGAGAGTTTATTCCTTTTTCGAAGAGCAATTCCACCATCCGTACTCGCTGTCAGTGTTTGAAAACACAATATTCTGGAGTGATTGGACTTCCAGGACGATACAGACAGCAGATAAAATACACGGCACTAGCATCAACAGGAACATTTTGCTGACCCTGGATACGCCGGTGTTTGACATGCACCTCTACCATCCGATCCTCACGAACGCAACCCACAACCCGTGTTCGTCTCGCTCGTGTCCACTCTGCCTTATAACATCCAACACGAGCTCTGTGTGCGCCTGTCCAGATTCTATGCGAAATGTTAAAGGGCATTGCGAATGGATACCCGGCTATCGTCCTGATTACTTGTTGGTAGCATCAGGGTCAGCTTTCATCAGAATATATTATGATACTGTCGGAAATCCCGAGACCCACTCGACTGTCCTCGATATAGGAAGGGTCCAAGCCATGGCCTATGACAATTTTAGAGATACTCTTTATATATACGATGGTCAACATAAGTCTATCAACTCTATCCTTATGAGCGACTTTTCCCTCGGCGTTACTCACCTTTTTATGTACAAAGGTCTTGAAAATGTTGTTGATATGGATTATGATTACGTATCCGATTCTTTATATATCTTGGACGCTGGTCGTCGCGTTATAGAAGTGACTTCACTGAAAAACAAACACACAGCATTACTATATCGATTTAGGGAAGAGGAAATACCCATAAGTTTTTGCGTTTTGTCAGCGTACGGACGTATGTTAGTTGCTGTCTTGGATACCGATAACGATATCATCTACATAGATTCATTCGGACTCGATGGTGACGATCGTAAACACATAGTTACAAATAATACACGTGGTCCAAATATCCGGATGAGGTACTCAACCGATCTGGATGTAGTATTTTTATCTGATGATCAAAACGGAGTAATTGACTTTCTACATCCACAAGGTACTGGTAGAGAGAATTTCCGTGAAGTTGTAACAAATATCGCTAGTCTCGCTGTGACCGACAATCAAGTGTTCTGGACGGACAAACGGTCGACCAAATTGTATTGGGCGGATATGCATGATGCGACGAGGAAGATACGCAGAATAGAATTATCAATATTTCCAAACACCACACATCTGGTTATTCTCGCTACTTCACCCCTTCCTAAATCACATGGTCACACATGCTCCAACTCGAACCCCCCTTGCTCCCATATTTGTGTGCAAAAATCGCACGAATACGTAAAACCGGGATCGATTGATTCAGCGCCTTTGCAGTACACTTGCTTGTGTCCCGTCGGCTTCATAAATAATAATGGGATTTGTTACGAAGTCACAAAGTGTAAAGAGCACGAGTTTTATTGTCATAAAAGTAATCAATGCTTTGACGGTAGCAAGAAATGCGATGGTACTGAAGATTGCAAATTTGGTGAAGACGAAGAAGGGTGCTTGATTAAAGGAACATATTATTTGAAAATATGTGAAGATGACGAAATCGATTGTCATGGCATTTGTATTCAGAGAACTGAAATCTGTCGGAACAATACCACAAAACAGTTGTTAACGTGCGGTTCCTCTGAATTCCGTTGCTCGGACAACTCTATATGTATAGAGCGTGCGCTCGCGTGTGACGGCCACGCGGACTGTAGAGACGCTTCTGATGAACACCCAGACGCCTGCGATACCAGAGACTGTGGCGAATTTGAATACATGTGTGCTTCAGGATCATGTATACCACTCACGTGGAAATGTGACAAACACGAAGATTGCTTGGACGGCTCCGATGAGATCAGCTGTGAGAGCAGAAGCTGTCCGTCCGGTACCTTTGAATGTGATACTGGATGTGTCGAGGTTTACAAGAGATGTGACGGAAAATACGACTGCGAAGATCACGAGGACGAAAGGGACTGTGATGAACCGGAGTTTGCTGGAGTTATTGACTTCTCGTCGTGCGCTCCGTGGGAGTACAGATGTGAGCACAACAAGTCAATTTGCTTGCCTCAAACAGCACGTTGCAATGGCCGCACAGACTGCCCGGGTGGTTCGGACGAGGCGGGATGTAACTTCCAGTGCGGGGAACTGTTCCCGTGCACTCAGGAACATTGGTGCGTGTACCGAGACCAGCTCTGTGATGGCAGGCAGGACTGCGCTGACGGATCCGACGAGACCTTCGACGTTTGTGCCAGAGCCAATAAAACAAGACCTCTGACACAAACGCCGCCAACACCTTGCGAGGACTACCGCTGTGATGATGGCCAGTGCCTGCCGTGGGAACACGTGTGTGACGACAAGACTCACTGCCGGGACGGCTCCGACGAGAACGGGCTCTGTAACAGTACTTGCGCTGCTGGGTGTATATCTCGGCGTACTCCCCGCGGCCCGCGGTGCTCATGCAAGAGTGTCTGTGAGGAGTGCTCGCACACGTGCCACACTGACGGGGAGACGTCCGTGTGCGCCTGCTATAAGGGATACGGACTTAGATTGGACCGTCGCTCGTGTAAGGCGCTGTATGGGGCTCCGGCCACCTTATACTCCCGGGGGGGTGCAGCGTGGTCATTGACCTCTCACGTTCATACAATGCTATATCACGAGGGGGAGGAACTCAGCGACTTGGATTGCGATGTCAGGCGAGGCAAGTTATATTTTACTTCGAGTGAGGTCGGTAAGCTGATTGAAGTGGACAACAAAAGACAGCCGCCAGTAAAAAGCATAACCAACATAGGGAGACCTGGCAAGTTGTCAGCGGACTGGATAACCGGTAACGTGTACTTCGCGGACAGCACTCCGTCCCAAGGCTCCATACGAGTTTGTAATTTCAAAAAGCAGAAGTGCGCCAAATTACAGAAGATACCCACTGATGTGCAGGTGACAGCCCTAGTTACAGACCCCGCTAATCACCTGTTATTCTACTGTCTATCTGACTCTTCTGAATCTCACATACGCTCGTCCTCGATGTCTGGTCGCTCTCCGTCCGACGTGGCCACGGTTCCAGCTTGTCTGGGCCTCGCTGTGAACTCCTTCAGCAAACTCCTGTATATCAGCACGCCATCTTCTATCATGAAGGTCGGCTATGATGGAAGCAATCTCATAACTCTCATAGACCACCCTATAAGCACACCTCACCTCGCATTCTTCGAGGACTACATATACTTCATACACAATTCCCACGTAACCCGCTGCCTGCACTTCGGCCCCAAAACCTGCGAGACCCTCAGCCACGTGTACAACGCCTCAAACTTCGTCCTAAGTCACGAGAGCATACAAAGAGATGACGTCATCAACTCGTGTGACGTCACGGAGTGCGGACACGTGTGCGTGTTGGATCGTGTAGCTGTGTGCGTGTGTCATGATGGCAGTATAGTTAGAGATGGAGTGTGTCCTGGGGACAGGGCGGAGGAGCAGGCCGTGTTCAGTGACGGCTCCCATTCCCACGTTTGGTCCTTAACATTTCTGTGGGTGCTGTTGCTGATGCTCGCGGTGTACGCGGCCGCGTTTGTCCACTACCGCCTCTACAGGAAAAACAAGACGCCAGCCGAATATATACAAGTCAGATATCACAACACCTCAGAAGGCCTGACTCATCTTTCGCACCCAATCATCGACGTCCCTGAAGCCGGAGCAATGAGCCACGAGTTTGTGAACCCACTGCAATTCGTAAGAAATTTTTGGCGGGAATCCTTCGAAAGACAGAAGCCGATCGGTTCAAATGTCATGTACGAAGAACAGCAGGATCCATCGGACACGGAGTCCGATCTCGATGTGAGAGAAACAAGGAGAATGATCAAATGA

Protein sequence:

>DPOGS216174-PA
MAVLLLLFSLLAVCSAQFKDDLQTYETECLMEGSFSCTSGACIPSEKYCDGYNDCEDGGDENFCANHRPDAHLCNETHQFLCTDGLMCLPSSWVCNYENDCKDGSDEMGCEKIIQHDNSSCKGFQCDGGKLCISDLWMCDGYYDCADKTDEDVVDTCHHAPRPKHLHDTLNCEVRSMNYTCLDKSYCIPYNNMCDGLKDCRDGSDEGAFCAEWSKMCANRTCPRESFCKPTRNGGTCVCNSYKEYNPSSGECERSRQCLQEVPVCSHMCEDMGDYFKCTCEDGYRSDHAQYLCFAPGPEAMLFFSTQNSIQYVTVKSNHSVTVLTGIKKAHGVAYDGKYLYWVETEKGHQAIMKAQLEDVAGTKQVLAALGLEDPGDIAVDYLGDNIYFSDTARGCITVCRTDGALCVTLAAHTRRPKFVTLDPRKGVMYWADKHDKPVIMKAKMDGSESENLVHELSTFAKGLALDAPNGRLYFVDGTIKVVILNDKRVYSFFEEQFHHPYSLSVFENTIFWSDWTSRTIQTADKIHGTSINRNILLTLDTPVFDMHLYHPILTNATHNPCSSRSCPLCLITSNTSSVCACPDSMRNVKGHCEWIPGYRPDYLLVASGSAFIRIYYDTVGNPETHSTVLDIGRVQAMAYDNFRDTLYIYDGQHKSINSILMSDFSLGVTHLFMYKGLENVVDMDYDYVSDSLYILDAGRRVIEVTSLKNKHTALLYRFREEEIPISFCVLSAYGRMLVAVLDTDNDIIYIDSFGLDGDDRKHIVTNNTRGPNIRMRYSTDLDVVFLSDDQNGVIDFLHPQGTGRENFREVVTNIASLAVTDNQVFWTDKRSTKLYWADMHDATRKIRRIELSIFPNTTHLVILATSPLPKSHGHTCSNSNPPCSHICVQKSHEYVKPGSIDSAPLQYTCLCPVGFINNNGICYEVTKCKEHEFYCHKSNQCFDGSKKCDGTEDCKFGEDEEGCLIKGTYYLKICEDDEIDCHGICIQRTEICRNNTTKQLLTCGSSEFRCSDNSICIERALACDGHADCRDASDEHPDACDTRDCGEFEYMCASGSCIPLTWKCDKHEDCLDGSDEISCESRSCPSGTFECDTGCVEVYKRCDGKYDCEDHEDERDCDEPEFAGVIDFSSCAPWEYRCEHNKSICLPQTARCNGRTDCPGGSDEAGCNFQCGELFPCTQEHWCVYRDQLCDGRQDCADGSDETFDVCARANKTRPLTQTPPTPCEDYRCDDGQCLPWEHVCDDKTHCRDGSDENGLCNSTCAAGCISRRTPRGPRCSCKSVCEECSHTCHTDGETSVCACYKGYGLRLDRRSCKALYGAPATLYSRGGAAWSLTSHVHTMLYHEGEELSDLDCDVRRGKLYFTSSEVGKLIEVDNKRQPPVKSITNIGRPGKLSADWITGNVYFADSTPSQGSIRVCNFKKQKCAKLQKIPTDVQVTALVTDPANHLLFYCLSDSSESHIRSSSMSGRSPSDVATVPACLGLAVNSFSKLLYISTPSSIMKVGYDGSNLITLIDHPISTPHLAFFEDYIYFIHNSHVTRCLHFGPKTCETLSHVYNASNFVLSHESIQRDDVINSCDVTECGHVCVLDRVAVCVCHDGSIVRDGVCPGDRAEEQAVFSDGSHSHVWSLTFLWVLLLMLAVYAAAFVHYRLYRKNKTPAEYIQVRYHNTSEGLTHLSHPIIDVPEAGAMSHEFVNPLQFVRNFWRESFERQKPIGSNVMYEEQQDPSDTESDLDVRETRRMIK-