Commit Graph

3023 Commits (b860967ce46358f6f54d2374af7e880155339392)
 

Author SHA1 Message Date
Marcin Cieślak 5137ebac0b [tvp] Telewizja Polska: new extractor for tvp.pl, fixes #1719
Thanks-To: mplonski

https://github.com/mplonski/linux/blob/master/tvp-dl.py
11 years ago
Marcin Cieślak a8eeb0597b Fix AssertionError when og property not found
On tvp.pl some webpages contain OpenGraph
metadata and some don't.

If og property is not found, _og_search_description
fails with

WARNING: unable to extract OpenGraph description; please report this issue on http://yt-dl.org/bug
Traceback (most recent call last):
  File "/usr/home/saper/bin/youtube-dl", line 18, in <module>
    youtube_dl.main()
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 766, in main
    _real_main(argv)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 719, in _real_main
    retcode = ydl.download(all_urls)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 715, in download
    videos = self.extract_info(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 348, in extract_info
    ie_result = ie.extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 125, in extract
    return self._real_extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/tvp.py", line 56, in _real_extract
    info['description'] = self._og_search_description(webpage)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 331, in _og_search_description
    return self._og_search_property('description', html, fatal=False, **kargs)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 325, in _og_search_property
    return unescapeHTML(escaped)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/utils.py", line 494, in unescapeHTML
    assert type(s) == type(u'')
AssertionError

The patch allows me to use:

  try:
    info['description'] = self._og_search_description(webpage)
    info['thumbnail'] = self._og_search_thumbnail(webpage)
  except RegexNotFoundError:
    pass
11 years ago
Ismaël Mejía 4ed3e51080 [ted] fixed error in case of no subtitles present
I created a test, but I leave it commented since TED videos get
new subtitles frequently.
11 years ago
Jaime Marquínez Ferrándiz 7f34001d57 Merge pull request #1724 from rzhxeo/generic_youtube
[GenericIE] Also detect youtube if src url of iframe is embedded in ' instead of "
11 years ago
rzhxeo 2dcf7d8f99 [GenericIE] Also detect youtube if src url of iframe is embedded in ' instaed of " 11 years ago
Jaime Marquínez Ferrándiz 19b0668251 [canal2c] Accept more urls (fixes #1723)
The url only needs to have the 'idVideo' field in the query, in any position.
We have to set the 'void=oui' in the webpage url, so that we get the file name.
11 years ago
Jaime Marquínez Ferrándiz e7e6b54d8a [teamcoco] Parse the xml file and extract all the formats 11 years ago
Jaime Marquínez Ferrándiz 2a1a8ffe41 Merge pull request #1693 from alexvh/teamcoco_fix
[teamcoco] Fix video url extraction for some videos
11 years ago
Philipp Hagemeister 08fb86c49b [youtube] Add description for YoutubeSearchDateIE (#1710) 11 years ago
Philipp Hagemeister 3633d77c0f Merge remote-tracking branch 'CBGoodBuddy/ytsearchtime' 11 years ago
Philipp Hagemeister 165e179764 release 2013.11.03 11 years ago
Philipp Hagemeister 12ebdd1506 [viddler] Support non-digit IDs (Fixes #1714) 11 years ago
Jaime Marquínez Ferrándiz 1baf9a5938 Merge pull request #1698 from rzhxeo/cinemassacre
[CinemassacreIE] Support more embed urls
11 years ago
Jaime Marquínez Ferrándiz a56f9de156 Style fixes for extractors: remove spaces around (,),{ and } 11 years ago
Jaime Marquínez Ferrándiz fa5d47af4b Merge pull request #1679 from rzhxeo/mofosex
Add support for http://www.mofosex.com
11 years ago
Jaime Marquínez Ferrándiz d607038753 Merge pull request #1677 from rzhxeo/xtube
Add support for http://www.xtube.com
11 years ago
Jaime Marquínez Ferrándiz 9ac6a01aaf Merge pull request #1676 from rzhxeo/extremetube
Add support for http://www.extremetube.com
11 years ago
Jaime Marquínez Ferrándiz be97abc247 Set the 'extractor_key' field in the info_dict
It's the string returned by the class method 'ie_key', which allows to retrieve the extractor with 'get_info_extractor'
11 years ago
Jaime Marquínez Ferrándiz 9103bbc5cd Add the 'webpage_url' field to info_dict
The url for the video page, it must allow to reproduce the result.
It's automatically set by YoutubeDL if it's missing.
11 years ago
Jaime Marquínez Ferrándiz b6c45014ae Set the extra_info inside YoutubeDL.process_ie_result and set only if the keys are missing 11 years ago
Craig Markwardt a3dd924871 Add YoutubeSearchDateIE extractor to youtube.py & __init__.py, which searches by publication date. 11 years ago
rzhxeo 137bbb3e37 [XTubeIE] Add description to TEST 11 years ago
rzhxeo 86ad94bb2e [ExtremeTubeIE] Set age_limit to 18 and fix uploader extraction 11 years ago
Jaime Marquínez Ferrándiz 3e56add7c9 Merge pull request #1678 from rzhxeo/keezmovies
[KeezMoviesIE] Detect URLs with numbers in the SEO part correct
11 years ago
Jaime Marquínez Ferrándiz f52f01b5d2 [brightcove] Don't set the extension
If the video only has the 'FLVFullLengthURL' key, it can still be an mp4 file.
11 years ago
Jaime Marquínez Ferrándiz 98d7efb537 [exfm] skip tests
The site is down too often.
11 years ago
Jaime Marquínez Ferrándiz cf51923545 [youtube] Remove vevo test
The video is no longer available and it seems that vevo video don't use encrypted signatures anymore.
11 years ago
Ismaël Mejía 38fcd4597a Merge remote-tracking branch 'iemejia/master' 11 years ago
Jaime Marquínez Ferrándiz 165e3bb67a [bambuser] Add an extractor for channels (closes #1702) 11 years ago
Ismaël Mejía 38db46794f Merge branch 'ted_subtitles' 11 years ago
Ismaël Mejía a9a3876d55 [ted] Added support for subtitle download 11 years ago
Ismaël Mejía 1f343eaabb [subtitles] refactor to support websites with subtitle information the
webpage.

I added the parameter webpage, so now it's similar to the way automatic
captions are handled. This is an improvement needed for websites like
TED.
11 years ago
Jaime Marquínez Ferrándiz 72a5b4f702 Add an extractor for bambuser.com (#1702) 11 years ago
rzhxeo 0a43ddf320 [CinemassacreIE] Add live paramter to extracted info as a workaround 11 years ago
rzhxeo 31366066bd Add support for live parameter to rtmpdump 11 years ago
Philipp Hagemeister aa2484e390 release 2013.11.02 11 years ago
Philipp Hagemeister 8eddf3e91d [youtube] Encode subtitle track name in request (Fixes #1700) 11 years ago
Jaime Marquínez Ferrándiz 60d142aa8d Add an extractor for vk.com (closes #1635) 11 years ago
Jaime Marquínez Ferrándiz 66cf3ac342 [metacafe] Fix support for age-restricted videos (fixes #1696)
The 'Content-Type' header must be set for disabling the family filter.
The 'flashversion' cookie  is only needed for AnyClip videos.
Added tests for standard metacafe videos and for age-restricted videos.
Also set the 'age_limit' field.
11 years ago
rzhxeo ab4e151347 [CinemassacreIE] Support more embed urls 11 years ago
Alex Van't Hof ac2547f5ff [teamcoco] Fix video url extraction for some videos
Video url extraction failed for some videos,
e.g. http://teamcoco.com/video/old-time-baseball

The url extracted was also occasionally suboptimal quality,
e.g. http://teamcoco.com/video/louis-ck-interview-george-w-bush
11 years ago
Jaime Marquínez Ferrándiz 5f1ea943ab [livestream] fix the extraction of events
It now uses a json dictionary from the webpage.
11 years ago
Jaime Marquínez Ferrándiz 0ef7ad5cd4 Fix the test for dailymotion subtitles
The extractor returns a single info_dict now.
11 years ago
Philipp Hagemeister 9f1109a564 [dailymotion] Fix support for age-restricted videos (Fixes #1688) 11 years ago
Philipp Hagemeister 33b1d9595d release 2013.10.30 11 years ago
Philipp Hagemeister 7193498811 Use index in formt string (Fixes vevo test on Python 2.6) 11 years ago
Philipp Hagemeister 72321ead7b [vevo] Readd support for SMIL (Fixes #1683) 11 years ago
Philipp Hagemeister b5d0d817bc Remove superfluous space 11 years ago
Philipp Hagemeister 94badb2599 Fix output indenting for --list-formats 11 years ago
Filippo Valsorda b9a836515f Update the Vimeo test vector md5
confirmed that this is indeed the first 10241 (we went off by one with
byte range 0-10240) of the full, playing mp4, so they probably
reencoded or something
11 years ago