Browse Source

Added Selenium

pull/45/head
Jure Šorn 5 years ago
parent
commit
69ee03791a
2 changed files with 25 additions and 0 deletions
  1. 14
      README.md
  2. 11
      index.html

14
README.md

@ -1808,6 +1808,20 @@ Scraping
('https://www.python.org/', '3.7.2')
```
### Selenium
**Library for scraping dynamically generated web content.**
```python
# $ brew cask install chromedriver
# $ pip3 install selenium
>>> from selenium import webdriver
>>> driver = webdriver.Chrome()
>>> driver.get(url)
>>> xpath = '//*[@id="mw-content-text"]/div/table[1]/tbody/tr[7]/td/div'
>>> driver.find_element_by_xpath(xpath).text.split()[0]
'3.7.2'
```
Web
---

11
index.html

@ -1483,6 +1483,17 @@ logger.&lt;level&gt;(<span class="hljs-string">'A logging message'</span>)
<span class="hljs-meta">&gt;&gt;&gt; </span>link, ver
(<span class="hljs-string">'https://www.python.org/'</span>, <span class="hljs-string">'3.7.2'</span>)
</code></pre>
<h3 id="selenium">Selenium</h3>
<p><strong>Library for scraping dynamically generated web content.</strong></p>
<pre><code class="python language-python hljs"><span class="hljs-comment"># $ brew cask install chromedriver</span>
<span class="hljs-comment"># $ pip3 install selenium</span>
<span class="hljs-meta">&gt;&gt;&gt; </span><span class="hljs-keyword">from</span> selenium <span class="hljs-keyword">import</span> webdriver
<span class="hljs-meta">&gt;&gt;&gt; </span>driver = webdriver.Chrome()
<span class="hljs-meta">&gt;&gt;&gt; </span>driver.get(url)
<span class="hljs-meta">&gt;&gt;&gt; </span>xpath = <span class="hljs-string">'//*[@id="mw-content-text"]/div/table[1]/tbody/tr[7]/td/div'</span>
<span class="hljs-meta">&gt;&gt;&gt; </span>driver.find_element_by_xpath(xpath).text.split()[<span class="hljs-number">0</span>]
<span class="hljs-string">'3.7.2'</span>
</code></pre>
<h2 id="web"><a href="#web" name="web">#</a>Web</h2>
<pre><code class="python language-python hljs"><span class="hljs-comment"># $ pip3 install bottle</span>
<span class="hljs-keyword">from</span> bottle <span class="hljs-keyword">import</span> run, route, post, template, request, response

Loading…
Cancel
Save