Philipp Hagemeister
deefc05b88
Document formats (for #980 )
11 years ago
Jaime Marquínez Ferrándiz
0d75ae2ce3
Fix detection of the webpage charset if it's declared using ' instead of "
Like in "<meta charset='utf-8'/>"
11 years ago
Philipp Hagemeister
f143d86ad2
[sohu] Handle encoding, and fix tests
11 years ago
Philipp Hagemeister
79cb25776f
Cache suitable regular expressions
This speeds up TestAllURLsMatching.test_no_duplicates by about 8000% at the cost of minimal memory overhead.
11 years ago
Jaime Marquínez Ferrándiz
5d51a883c2
Use a dictionary for storing the subtitles
The errors while getting the subtitles are reported as warnings, if no subtitles are found return and empty dict.
11 years ago
Philipp Hagemeister
f38de77f6e
Use unescapeHTML for OpenGraph properties
These are attribute values, so we don't need the more complex and whitespace-destroying cleanHTML - we just need to unescape quotes, that's it.
11 years ago
Philipp Hagemeister
b9d3e1635f
Strip hash info from URL when making requests ( Fixes #1038 )
11 years ago
Philipp Hagemeister
3c4e6d8337
Improve OpenGraph property matching
11 years ago
Jaime Marquínez Ferrándiz
44dbe89035
Use re.DOTALL by default when searching OpenGraph properties
11 years ago
Jaime Marquínez Ferrándiz
46720279c2
InfoExtractor: add some helper methods to extract OpenGraph info
11 years ago
Philipp Hagemeister
690e872c51
Remove video_result helper method
Calling it was more complex then actually including the type in the video info
11 years ago
Jaime Marquínez Ferrándiz
56c7366547
YoutubeIE: reuse instances of InfoExtractors ( closes #998 )
When a IE is added to the list, it's also added to a dictionary. When a IE is requested it first looks in the dictionary and if there's no instance it will create a new one.
That way _real_initialize is only called once for each IE, saving time if it needs to login for example.
11 years ago
Philipp Hagemeister
73e79f2a1b
[3sat] Add support ( Fixes #1001 )
11 years ago
Jaime Marquínez Ferrándiz
fc79158de2
VimeoIE: authentication support ( closes #885 ) and add a method in the base InfoExtractor to get the login info
11 years ago
Philipp Hagemeister
0f81866329
Add --list-extractor-descriptions (human-readable list of IEs)
11 years ago
Philipp Hagemeister
f3d294617f
Document view_count ( Closes #963 )
11 years ago
Filippo Valsorda
98bcd2834a
improve generic and encrypted signature error messages
11 years ago
Philipp Hagemeister
3c25b9abae
Remove useless headers
11 years ago
Philipp Hagemeister
d6983cb460
Fix generic class move (add all files)
11 years ago