downloading folders from google drive.

I wanted to download some course material on RL shared by the author via Google drive using the command line.  I got a bunch of stuff using wget a folder in google drive was a challenge. I looked it up in SO which gave me a hint but no solution. I installed gdown using pip and then used: gdown --folder --continue https://drive.google.com/drive/folders/1V9jAShWpccLvByv5S1DuOzo6GVvzd4LV if there are more than 50 files you need to use --remaining-ok and only get the first 50. In such a case its best to download using the folder using the UI and decompress locally. Decompressing from the command line created errors related to unicode but using the mac UI I decompressed without a glitch.

Data Science Glossary - S

Sampling
is the of statistical technique of selecting a minimal subset of data that is representative of the full sample and allows reporting on the trends based on just the sample set. Sampling is widely used in statistical analysis because analyzing a samples provides faster and cheaper alternative to working with the full data sets. It is both a blessing when used correctly an when applied arbitrarily to all data. GA uses the latter approach which renders many reports useless for tracking smaller segment.
Screaming Frog (SEO Spider)
is a web crawler capable of automating the checking a web site for the presence of snippets such as the UA code snippet. The software is available here.
Segment
a segment is a way to slice your data, google analytics comes with a number of segments and also allows you to use custom segments.
SDK
is an acronym for software development kit. For example to implement analytics on mobile devices an SDK is required.
SEO
is an acronym for search engine optimization.
Session
since the web is inherently stateless, the session is managed by a web server with the rule being that there is a maximal timeout usually of 30 minutes. The collection of  sequence of  page-views on a single server.
Self selected sample
is when the inclusion or exclusion of sampling units is determined by whether the participant themselves agree or decline to participate in the sample, either explicitly or implicitly.
Self selection bias
self selection protocols tend to introduce a bias.
Site map
an XML file the is used by crawlers to locate all the pages in a web site or application.
SMO
Social media optimization. The misguided attempt at getting higher rank and broader reach for activities in a social network.
Social network analysis
an interdisciplinary methodology - combining social science and using graph theory to understand the social structure of your users as a collective.

Comments

Popular posts from this blog

Moodle <=< Mediawiki SUL integration - first thoughts

downloading folders from google drive.

Insight into progressive web apps