{"id":4597,"date":"2018-12-06T12:03:06","date_gmt":"2018-12-06T12:03:06","guid":{"rendered":"https:\/\/max-drake.cc\/?p=4597"},"modified":"2018-12-07T07:42:43","modified_gmt":"2018-12-07T07:42:43","slug":"finding-leads-for-my-services-website-web-scraping-targeting-business-urls-part-3","status":"publish","type":"post","link":"https:\/\/max-drake.cc\/?p=4597","title":{"rendered":"Finding leads for my Services Website- web scraping?  Targeting business URL&#8217;s-Part 3"},"content":{"rendered":"<p>At the end of the<a href=\"https:\/\/max-drake.cc\/2018\/12\/05\/finding-leads-for-my-services-website-web-scraping-part-2\/\" target=\"_blank\" rel=\"noopener\"><strong> last post<\/strong> <\/a>I put a few steps for my process:<\/p>\n<p><strong>Part 1<\/strong>.Initially broad targeting. Harvesting personal emails with searches which include gmail.com etc.<\/p>\n<p><strong>Part 2<\/strong>. I need to find some Facility Management Directories and do a web crawl on some of those.<\/p>\n<p>To do that I could use the Firefox <strong><a href=\"https:\/\/addons.mozilla.org\/en-US\/firefox\/user\/14008366\/\" target=\"_blank\" rel=\"noopener\">add-in Web Scraper<\/a><\/strong> but it is slow and while it is running it locks up my browser.<\/p>\n<p>Instead I used Outwit Hub (Free version) that you can find on their site <a href=\"https:\/\/www.outwit.com\/\" target=\"_blank\" rel=\"noopener\"><strong>Outwit.com<\/strong><\/a>. I had a look at this on <a href=\"https:\/\/max-drake.cc\/2018\/06\/06\/web-searching-scraping-free-tools-free-extract-tables-tool-from-pdfs\/\" target=\"_blank\" rel=\"noopener\"><strong>my post here<\/strong><\/a>.<\/p>\n<p>While looking for this link I see that there is also an Email Sourcer (free version) that you can download. I have just done so and will try it shortly.<\/p>\n<p>I use the Outwit scraper as it allows me to continue to use my preferred Firefox browser while the scrape is proceeding in the background.<\/p>\n<p>The video below was interesting in that the process was quite simple. The main tool he uses for checking emails is now a paid service.<\/p>\n<p>https:\/\/youtu.be\/wVcs7jaf8v4<\/p>\n<p>The interesting takeaways from the video isn my opinion were:<\/p>\n<ul>\n<li>Free<strong> <a href=\"https:\/\/www.seoweather.com\/free-seo-tools\/\" target=\"_blank\" rel=\"noopener\">SEO Tools by SEO Weather<\/a><\/strong> for shortening URLs<\/li>\n<li><a href=\"http:\/\/www.googleguide.com\/advanced_operators_reference.html\" target=\"_blank\" rel=\"noopener\"><strong>Google Guide for advanced search operators<\/strong><\/a>.<\/li>\n<\/ul>\n<p>I initially used Outwit to search some Google Search pages like Facility Management Companies US and then set up a scraper to take the Company&#8217;s URL and text\u00a0 info from the URL&#8217;s listed in the Google search. As I&#8217;d set google search results to the maximum of 100 per page it would harvest 100 rows of data that could then be viewed,<\/p>\n<p><strong>Open Outwit Hub Lite and type in Google URL search.<\/strong><\/p>\n<p><img decoding=\"async\" class=\"wp-image-4602 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02-300x174.jpg\" alt=\"\" width=\"1448\" height=\"840\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02-300x174.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02-768x445.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02-1024x594.jpg 1024w\" data-sizes=\"(max-width: 1448px) 100vw, 1448px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1448px; --smush-placeholder-aspect-ratio: 1448\/840;\" \/><\/p>\n<p><strong>Use browser to find Markers<\/strong><\/p>\n<p>View page source and find the items that you want. You want to scrape data that is structured on each search. The Outwit hub lit needs the Marker before and the marker after the data that you want to pull. Best to use a Browser and use Right click &#8220;inspect element&#8221; and it will show you the Marker before and the marker after that you can cut\/paste into Outwit scraper that you are setting up<\/p>\n<p><img decoding=\"async\" class=\"alignnone wp-image-4605 lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im3b-300x168.jpg\" alt=\"\" width=\"1691\" height=\"947\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im3b-300x168.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im3b-768x430.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im3b-1024x573.jpg 1024w\" data-sizes=\"(max-width: 1691px) 100vw, 1691px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1691px; --smush-placeholder-aspect-ratio: 1691\/947;\" \/><\/p>\n<p><strong>Put Marker codes into Scraper in Outwit<\/strong><\/p>\n<p><img decoding=\"async\" class=\"wp-image-4603 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03-300x105.jpg\" alt=\"\" width=\"1497\" height=\"524\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03-300x105.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03-768x268.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03-1024x357.jpg 1024w\" data-sizes=\"(max-width: 1497px) 100vw, 1497px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1497px; --smush-placeholder-aspect-ratio: 1497\/524;\" \/><\/p>\n<p><strong>Scrape pages using Outwit and export<\/strong><\/p>\n<p>Then go to the google search page (if more than one, select first) then press the scraper button and execute and export to a CSV file. After that , change the google serch to next page and do again and carry on until all the pages are viewed, scraped &amp; extracted to files. Then join all the files together.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4604 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03a-300x132.jpg\" alt=\"\" width=\"1468\" height=\"646\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03a-300x132.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03a-768x338.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im03a-1024x451.jpg 1024w\" data-sizes=\"(max-width: 1468px) 100vw, 1468px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1468px; --smush-placeholder-aspect-ratio: 1468\/646;\" \/><\/p>\n<p><strong>Excel page with all the exports.<\/strong><\/p>\n<p>I then take each Company URL address and check the company URL in Chrome to see what emails occur.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4601 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im04-300x96.jpg\" alt=\"\" width=\"1694\" height=\"542\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im04-300x96.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im04-768x246.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im04-1024x328.jpg 1024w\" data-sizes=\"(max-width: 1694px) 100vw, 1694px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1694px; --smush-placeholder-aspect-ratio: 1694\/542;\" \/><\/p>\n<p><strong>Chrome using the Email Extractor plug-in<\/strong><\/p>\n<p>Using advanced search\u00a0\u00a0 <em>&#8220;allinurl:\u00a0\u00a0 xxxxxxx.com.au&#8221;\u00a0<\/em> I look to see what emails are harvested &amp; save them to a separate file.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4600 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im05-300x140.jpg\" alt=\"\" width=\"1500\" height=\"700\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im05-300x140.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im05-1024x479.jpg 1024w\" data-sizes=\"(max-width: 1500px) 100vw, 1500px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1500px; --smush-placeholder-aspect-ratio: 1500\/700;\" \/><\/p>\n<p>Unfortunately, a lot of the emails are not viable, and nearly all the ones with names in were the erroneous ones. Those were the more specific ones I was interested in. So I decided to check a few of the Failed ones.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4606 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im06-300x216.jpg\" alt=\"\" width=\"1611\" height=\"1160\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im06-300x216.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im06-768x553.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im06-1024x737.jpg 1024w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im06.jpg 1459w\" data-sizes=\"(max-width: 1611px) 100vw, 1611px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1611px; --smush-placeholder-aspect-ratio: 1611\/1160;\" \/><\/p>\n<p><strong>Validating the validator<\/strong><\/p>\n<p>I looked for some other email validator\u00a0 and found <a href=\"https:\/\/www.usethistip.com\/3-websites-to-check-if-email-address-is.html\" target=\"_blank\" rel=\"noopener\"><strong>this post<\/strong><\/a>. I tried the first option <a href=\"https:\/\/verify-email.org\/\" target=\"_blank\" rel=\"noopener\"><strong>Verify Email.org<\/strong><\/a> but it only does one email at a time, but it validated what Dr Email Verifier said, then I tried <a href=\"http:\/\/www.mailtester.com\/testmail.php\" target=\"_blank\" rel=\"noopener\"><strong>Mailtester<\/strong><\/a> and the online site will let you test one Email at a time but there was a trail download version that allowed you to check 20 emails and the results were:<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4607 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im07-300x99.jpg\" alt=\"\" width=\"1445\" height=\"477\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im07-300x99.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im07-768x253.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im07-1024x338.jpg 1024w\" data-sizes=\"(max-width: 1445px) 100vw, 1445px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1445px; --smush-placeholder-aspect-ratio: 1445\/477;\" \/><\/p>\n<p>So it looks like Dr Email Verifier is doing an OK job.<\/p>\n<p>Of the emails I harvested with this process yesterday , of the 119 only 31 were valid so a 28% success and a lot of those were generic ones like &#8220;sales@&#8230;.&#8221; or &#8220;info@..&#8221; etc. Of the valid people ones only 9, so an 7.5% success from that harvesting method. What I&#8217;m doing in the above process is not that effective. Actually, there were a couple of duds so only 7 went, so down to about 6%.<\/p>\n<p>Because there were so few I could easily split out the first name into a separate column for the salutation. Then, as it was an email address it was all in lower case. To make the first letter go to upper case you use the formula\u00a0\u00a0 =PROPER(A2)\u00a0 in Excel and it will make the first letter upper case. You then copy the column with the Upper Case letter at the front and paste VALUES onto the original lower case column.<\/p>\n<p>Also not an elegant workflow, I&#8217;m hopping between programmes and excel sheets to get the work done. I&#8217;ll think about seeing if I could design a better scraper that would target emails. I&#8217;ll have to look into how to code that. I most probably have to go on to a company&#8217;s website and look for it there, although the markers may be different on each specific company&#8217;s pay, although the proceeding one could be something like &#8220;mailto&#8221;.<\/p>\n<p>&nbsp;<\/p>\n<h3>Email Sourcer light is a DUD<\/h3>\n<p>It runs nicely and crawls links to find emails, then it overwrites the emails in the free version so they are unusable. The pro version costs $69 US. Unfortunately it does not tell you that the free version is useless until you download and try it. A bit sneaky, not a marketing method I approve of, but the OutWit free web scraper is nice.\u00a0 That&#8217;s an hour gone wasting my time with that.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4610 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im08-300x183.jpg\" alt=\"\" width=\"1493\" height=\"911\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im08-300x183.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im08-768x469.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im08-1024x625.jpg 1024w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im08.jpg 1762w\" data-sizes=\"(max-width: 1493px) 100vw, 1493px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1493px; --smush-placeholder-aspect-ratio: 1493\/911;\" \/><\/p>\n<p><img decoding=\"async\" class=\"wp-image-4609 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im09-300x137.jpg\" alt=\"\" width=\"1535\" height=\"701\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im09-300x137.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im09-768x351.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im09-1024x468.jpg 1024w\" data-sizes=\"(max-width: 1535px) 100vw, 1535px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1535px; --smush-placeholder-aspect-ratio: 1535\/701;\" \/><\/p>\n<p><img decoding=\"async\" class=\"wp-image-4608 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im10-300x175.jpg\" alt=\"\" width=\"1524\" height=\"889\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im10-300x175.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im10-768x447.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im10-1024x596.jpg 1024w\" data-sizes=\"(max-width: 1524px) 100vw, 1524px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1524px; --smush-placeholder-aspect-ratio: 1524\/889;\" \/><\/p>\n<h3>Banners on other sites.<\/h3>\n<p>One resource I had not got around to was to use my existing websites to tell visitors about the new site.<\/p>\n<p>I used a simple header banner on my website when I was doing a MailChimp marketing exercise offering free lessons. I quite like the header banner, it is discrete and doesn&#8217;t annoy as much as pop up and dynamic banners. I used <a href=\"https:\/\/wordpress.org\/plugins\/simple-banner\/\" target=\"_blank\" rel=\"noopener\"><strong>simple banner plugin<\/strong><\/a> for the site. One of the issues I have had in the past is getting the background colour to match other colours on the site, I have usually fiddled away with it and finally accepted a\u00a0 &#8216;near enough&#8217; solution. This time I used an online colour picker, so easy, why didn&#8217;t I do this before? The site I used <a href=\"https:\/\/imagecolorpicker.com\/\" target=\"_blank\" rel=\"noopener\"><strong>Color Picker Online<\/strong><\/a> which allows you to either use an image or link to a URL. I tried the initial method first and got a great match. Then I checked with the 2nd method and the colours selected wire slightly different. Definitely a great tool, this did save me a great deal of time.<\/p>\n<p>Anyway, after getting the banner as I wanted it on one site, I used the same setup for my other sites. Hopefully this will interest visitors to one site to look at the other. Generally they are people who are interested in the topic so maybe this will get a bit of traffic to the site.<\/p>\n<h3>End thoughts<\/h3>\n<p>A days work for 7 email addresses!! When looked at from that perspective not an efficient use of time. This will need some mulling over.<\/p>\n<p>There are a lot of paid email scrapers, eg the Outwit one, that for a years license is $45 US. In terms of investing time and energy into this maybe a tool like that is useful. The emails will still need to be validated. Maybe that is the method to use.But going through the process give you an understanding of what needs to be considered in the whole procedure. Also maybe there is, in looking at the process a loss of the end objective which is to drive viewers to the site, specifically viewers who are interested in the services and some who may consider using some of the services.<\/p>\n<p>At this point in time I am cash poor but have time to explore solutions.I am learning a lot. At present though I&#8217;m not being that effective.<\/p>\n<p>Here are the metrics of the site. Very sad \ud83d\ude41<\/p>\n<p><img decoding=\"async\" class=\"wp-image-4611 aligncenter lazyload\" data-src=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im11-300x189.jpg\" alt=\"\" width=\"1440\" height=\"907\" data-srcset=\"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im11-300x189.jpg 300w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im11-768x483.jpg 768w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im11-1024x644.jpg 1024w, https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im11-200x125.jpg 200w\" data-sizes=\"(max-width: 1440px) 100vw, 1440px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1440px; --smush-placeholder-aspect-ratio: 1440\/907;\" \/><\/p>\n<p>Still, as my father used to tell me, <em>&#8220;there is only one way to go from here&#8221;. <\/em>We&#8217;ll see what comes next.<\/p>\n<p>Things that I&#8217;ve picked up that are useful (mainly in this part of the process as they are fresh in my memory):<\/p>\n<ul>\n<li>Free<strong> <a href=\"https:\/\/www.seoweather.com\/free-seo-tools\/\" target=\"_blank\" rel=\"noopener\">SEO Tools by SEO Weather<\/a><\/strong> for shortening URLs. The URL trimmer. This is a quick tool and I think will come in handy. Especially with the Outwit crawls.<\/li>\n<li><a href=\"http:\/\/www.googleguide.com\/advanced_operators_reference.html\" target=\"_blank\" rel=\"noopener\"><strong>Google Guide for advanced search operators<\/strong><\/a>. This will hopefully extract more from fewer queries. I can only practice to see if they are effective.<\/li>\n<li><a href=\"https:\/\/www.outwit.com\/\" target=\"_blank\" rel=\"noopener\"><strong>Outwit.com<\/strong><\/a> Outwit hub light. Definitely useful.<\/li>\n<li>Dr Email Verifier. Seems to be accurate and works on most of the company email addresses (so good apart from unsupported servers like yahoo) and this was checked with other validation tools.<\/li>\n<li><a href=\"https:\/\/imagecolorpicker.com\/\" target=\"_blank\" rel=\"noopener\"><strong>Color Picker Online<\/strong><\/a> was a good find for web stuff.<\/li>\n<li>The banner on my existing websites. Hopefully that will get some people to explore the DataIKnoW site.<\/li>\n<li>Thunderbird Mail Merge<\/li>\n<\/ul>\n<p>Not so useful:<\/p>\n<ul>\n<li>Outwit Email sourcer light. Of no use. But the paid version looks as if it will be useful.<\/li>\n<li>I dont think the untargeted mailout is working so far. 58 + 124 = 182 emails and no response. I do need to have another look at the template. I think I need to have a couple of re-writes of it.<\/li>\n<\/ul>\n<p>Another thing I think is relevant to this process is getting things out there working. As this is a passive site things need to percolate down and gain traction. That is my understanding of the process so far, so by sending stuff into hyperspace something has been tried, so you can move onto the next idea. The more attempts at different things will increase awareness of the site generally. One video I saw said 3-6 months for some sort of traction.<\/p>\n<p>I seem to have that traction regarding OpenMaint on my sites as they come up quite high in the rankings on &#8220;OpenMaint Setup&#8221; and OpenMaint Configuration&#8221;. So I suppose time is a factor in the process<\/p>\n<p>So the journey continues, onwards\u00a0 and upwards.<\/p>\n<p>I think I need to revisit this part 2 of the process of finding company URL&#8217;s to scrape to find email addresses. That&#8217;s another post.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>At the end of the last post I put a few steps for my process: Part 1.Initially broad targeting. Harvesting personal emails with searches which include gmail.com etc. Part 2. I need to find some Facility Management Directories and do a web crawl on some of those. To do that I could use the Firefox [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4602,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[40,3,42,29],"tags":[],"class_list":["post-4597","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-analysis","category-data-extraction","category-productivity","category-web"],"featured_image_src":"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02.jpg","featured_image_src_square":"https:\/\/max-drake.cc\/wp-content\/uploads\/2018\/12\/im02.jpg","author_info":{"display_name":"Max Drake","author_link":"https:\/\/max-drake.cc\/?author=1"},"_links":{"self":[{"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/posts\/4597","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/max-drake.cc\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4597"}],"version-history":[{"count":0,"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/posts\/4597\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/max-drake.cc\/index.php?rest_route=\/wp\/v2\/media\/4602"}],"wp:attachment":[{"href":"https:\/\/max-drake.cc\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4597"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/max-drake.cc\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4597"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/max-drake.cc\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4597"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}