Adam Thalhammer
c587cbb793
improved performance by extracting accented chars to top level
8 years ago
Adam Thalhammer
79a2e94e79
Instead of replacing accented characters with an underscore when sanitizing file names in restricted mode, replace them with their non-accented equivalents fixes #9347
8 years ago
Sergey M․
eb9ee19422
[utils] Allow None mimetypes in mimetype2ext
8 years ago
remitamine
acaff49575
[utils] imporove parse_duration to handle more formats
8 years ago
Yen Chi Hsuan
cacd996662
[utils] Don't touch URLs if not necessary
Fix test_Generic_15 (Google redirect)
8 years ago
Jaime Marquínez Ferrándiz
5bf28d7864
[utils] dfxp2srt: add additional namespace
Used by the ZDF subtitles (#9081 ).
8 years ago
Sergey M․
15d260ebaa
[utils] Use update_Request in http_request
8 years ago
Sergey M․
ed0291d153
[utils] Add update_Request
8 years ago
Sergey M․
17bcc626bf
[utils] Extract sanitize_url routine
8 years ago
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
8 years ago
Yen Chi Hsuan
622d19160b
[utils] Clarify Python versions affected by buggy struct module
8 years ago
Yen Chi Hsuan
efbed08dc2
[utils] Encode hostnames before passing to urllib
With IDN (Internationalized Domain Name) and a proxy, non-ascii URLs
are passed down to urllib/urllib2, causing UnicodeEncodeError
Fixes #8890
8 years ago
Jaime Marquínez Ferrándiz
782b1b5bd1
[utils] lookup_unit_table: Match word boundary instead of end of string
8 years ago
Jaime Marquínez Ferrándiz
09fc33198a
utils: lookup_unit_table: Use a stricter regex
In parse_count multiple units start with the same letter, so it would match different units depending on the order they were sorted when iterating over them.
8 years ago
Sergey M․
810c10baa1
[utils] Use compat_xpath
8 years ago
Sergey M․
c5229f3926
[utils] PEP 8
8 years ago
Sergey M․
2f7ae819ac
[utils] PEP 8
8 years ago
Sergey M․
fb47597b09
[bbc] Generalize unit table lookup and add parse_count
8 years ago
Yen Chi Hsuan
25cb05bda9
[utils] Remove codec2ext
This function is orignally used for determining file extensions of DASH
formats. Now in DASH, ext is determined by mime_type. See #8766 for more
information.
8 years ago
Yen Chi Hsuan
6d210f2090
[utils] Add more codecs to codec2ext
BBC uses avc3. Here's an example (thanks to @remitamine for this example)
http://rdmedia.bbc.co.uk/dash/ondemand/bbb/2/client_manifest-common_init.mpd
See also https://trac.ffmpeg.org/ticket/5217
8 years ago
Yen Chi Hsuan
19a17d4623
[utils] Add codec2ext
8 years ago
Jaime Marquínez Ferrándiz
3233a68fbb
[utils] update_url_query: Encode the strings in the query dict
The test case with {'test': '第二行тест'} was failing on python 2 (the non-ascii characters were replaced with '?').
8 years ago
remitamine
38f9ef31dc
[utils] add update_url_query function
8 years ago
Yen Chi Hsuan
8ee239e921
[utils] Jython support - handle filenames correctly
Now test:youtube downloads
8 years ago
Brian Foley
8bb56eeeea
[utils] Add extract_attributes for extracting html tag attributes
This is much more robust than just using regexps, and handles all
the common scenarios, such as empty/no values, repeated attributes,
entity decoding, mixed case names, and the different possible value
quoting schemes.
9 years ago
remitamine
e07237f640
[utils] remove check for val from find_xpath_attr
8 years ago
Yen Chi Hsuan
5eb6bdced4
[utils] Multiple changes to base_n()
1. Renamed to encode_base_n()
2. Allow tables longer than 62 characters
3. Raise ValueError instead of AssertionError for invalid input data
4. Return the first character in the table instead of '0' for number 0
5. Add tests
8 years ago
Yen Chi Hsuan
680079be39
[utils] Relaxing regex in decode_packed_codes for vidzi
8 years ago
Yen Chi Hsuan
f52354a889
[utils] Move codes for handling eval() from iqiyi.py
8 years ago
Yen Chi Hsuan
59f898b7a7
[utils] Merge base_n functions
8 years ago
Yen Chi Hsuan
481888294d
[utils] Add base36 for use in Vidzi
8 years ago
Yen Chi Hsuan
81bdc8fdf6
[utils] Move base62 to utils
8 years ago
Sergey M․
f160785c5c
[utils] Remove AM/PM from unified_strdate patterns
8 years ago
Yen Chi Hsuan
b95dc034ca
[utils] Implement cache for OnDemandPagedList
8 years ago
remitamine
cafcf657a4
add more subtitles mime types to mimetype2ext and fix the platform subtitle extraction
8 years ago
Yen Chi Hsuan
c1c05c67ea
[utils] Jython support - disable setproctitle() until ctypes is complete
8 years ago
Yen Chi Hsuan
399a76e67b
[utils] Jython support: tolerate missing fcntl module
8 years ago
Jaime Marquínez Ferrándiz
765ac263db
[utils] mimetype2ext: return 'm4a' for 'audio/mp4' ( fixes #8620 )
The youtube extractor was using 'mp4' for them, therefore filters like 'bestaudio[ext=m4a]' stopped working (94278f7202
broke it).
8 years ago
Yen Chi Hsuan
5bc880b988
[utils] Add OHDave's RSA encryption function
8 years ago
Sergey M․
611c1dd96e
[refactor] Single quotes consistency
8 years ago
Sergey M․
d800609c62
[refactor] Do not specify redundant None as second argument in dict.get()
8 years ago
Sergey M․
9c7b38981c
[utils] Bump Firefox version in User-Agent
Old version number causes Youtube not to serve some formats in ytplayer.config
8 years ago
Sergey M․
8411229bd5
[utils] Allow dot in strip_jsonp
8 years ago
Sergey M․
86296ad2cd
[utils] Add ability to control skipping false values in dict_get
8 years ago
Sergey M․
cbecc9b903
[utils] Add dict_get convenience method
8 years ago
Jaime Marquínez Ferrándiz
87de7069b9
[utils] dfxp2srt: make TTMLPElementParser inherit from object
For consistency between python 2 and 3.
8 years ago
remitamine
2b14cb566f
[utils] fix dfxp2srt text extraction( fixes #8055 )
8 years ago
Yen Chi Hsuan
a0d8d704df
[utils] Reorder items in mimetype2ext alphabetically
8 years ago
Yen Chi Hsuan
f6861ec96f
[utils] Add more items to mimetype2ext ( #8293 )
These are used in Youtube formats
8 years ago
remitamine
6ec6cb4e95
Revert "fix typos"
This reverts commit 36a0e46c39
.
9 years ago