hydrus/help/faq.html

55 lines
7.6 KiB
HTML
Raw Normal View History

2013-03-27 20:02:51 +00:00
<html>
<head>
<title>faq</title>
<link href="hydrus.ico" rel="shortcut icon" />
<link href="style.css" rel="stylesheet" type="text/css" />
</head>
<body>
<div class="content">
2015-08-12 20:35:24 +00:00
<a name="repositories"><h3>what is a repository?</h3></a>
<p>A <i>repository</i> is a service in the hydrus network that stores a certain kind of information<6F>files or tag mappings, for instance<63>as submitted by users all over the internet. Those users periodically synchronise with the repository so they know everything that it stores. Sometimes, like with tags, this means creating a complete local copy of everything on the repository. Hydrus network clients never send queries to repositories; they perform queries over their local cache of the repository's data, keeping everything confined to the same computer.</p>
<a name="tags"><h3>what is a tag?</h3></a>
2015-10-21 21:53:10 +00:00
<p><a href="https://en.wikipedia.org/wiki/Tag_(metadata)">wiki</a></p>
2013-12-11 22:09:25 +00:00
<p>A <i>tag</i> is a small bit of text describing a single property of something. They make searching easy. Good examples are "flower" or "nicolas cage" or "the sopranos" or "2003". By combining several tags together ( e.g. [ 'tiger woods', 'sports illustrated', '2008' ] or [ 'cosplay', 'the legend of zelda' ] ), a huge image collection is reduced to a tiny and easy-to-digest sample.</p>
2013-03-27 20:02:51 +00:00
<p>A good word for the connection of a particular tag to a particular file is <i>mapping</i>.</p>
<p>In the hydrus network, all tags are automatically converted to lower case. 'Sunset Drive' becomes 'sunset drive'. Why?</p>
<ol>
2015-08-12 20:35:24 +00:00
<li>Although it may seem preferable to have 'The Lord of the Rings' rather than 'the lord of the rings', there are many, many special cases where style guides differ on which words to capitalise.</li>
<li>Searches become far easier when case is not matched. And When case does not matter, what point is there in recording it?</li>
2013-03-27 20:02:51 +00:00
</ol>
<p>Secondly, leading and trailing whitespace is removed, and multiple whitespace is collapsed to a single character. <pre>' yellow dress '</pre> becomes <pre>'yellow dress'</pre></p>
2015-08-12 20:35:24 +00:00
<a name="namespaces"><h3>what is a namespace?</h3></a>
<p>A <i>namespace</i> is a category that in hydrus prefixes a tag. An example is 'person' in the tag 'person:ron paul'--it lets people and software know that 'ron paul' is a name. You can create any namespace you like; just type one or more words and then a colon, and then the next string of text will have that namespace.</p>
<p>The hydrus client gives namespaces different colours so you can pick out important tags more easily in a large list, and you can also search by a particular namespace, even creating complicated predicates like 'give all files that do not have any character tags', for instance.</p>
<a name="filenames"><h3>why not use existing filenames and folders?</h3></a>
2013-12-11 22:09:25 +00:00
<p>As a retrieval method, filenames and folders are less and less useful as the number of files increases. Why?</p>
2013-03-27 20:02:51 +00:00
<ul>
2013-12-11 22:09:25 +00:00
<li>A filename is not unique; did you mean this "04.jpg" or <i>this</i> "04.jpg" in another folder? Perhaps "04 (3).jpg"?</li>
<li>A filename is not guaranteed to describe the file correctly, e.g. hello.jpg</li>
2015-08-12 20:35:24 +00:00
<li>A filename is not guaranteed to stay the same, meaning other programs cannot rely on the filename address being valid or even returning the same data every time.</li>
<li>A filename is often<65>for <i>ridiculous</i> reasons<6E>limited to a certain prohibitive character set. Even when utf-8 is supported, some arbitrary ascii characters are usually not, and different localisations, operating systems and formatting conventions only make it worse.</p>
2013-03-27 20:02:51 +00:00
<li>Folders can offer context, but they are clunky and time-consuming to change. If you put each chapter of a comic in a different folder, for instance, reading several volumes in one sitting can be a pain. Nesting many folders adds navigation-latency and tends to induce less informative "04.jpg"-type filenames.</li>
</ul>
<p>So, the client tracks files by their <i>hash</i>.</p>
2015-10-21 21:53:10 +00:00
<p>Please do not tag your files with their exact original 'filename.jpg' on my public tag repo. <a href="https://www.youtube.com/watch?v=_yYS0ZZdsnA">Shed the concept of filenames as you would chains.</a></p>
2015-08-12 20:35:24 +00:00
<a name="hashes"><h3>what is a hash?</h3></a>
2015-10-21 21:53:10 +00:00
<p><a href="https://en.wikipedia.org/wiki/Hash_function">wiki</a></p>
2015-08-12 20:35:24 +00:00
<p>Hashes are a subject you usually has to be a software engineer to find interesting. The simple answer is that they are unique names for things. It can be proven that f099b5823f4e36a4bd6562812582f60e49e818cf445902b504b5533c6a5dad94 refers to one particular file and no other. Hashes make excellent identifiers inside software. In the client's normal operation, you will never encounter a file's hash. If you want to see a thumbnail bigger, double-click it; the software handles the mathematics.</p>
2013-12-11 22:09:25 +00:00
<p><i>For those who </i>are<i> interested: hydrus uses SHA-256, which spits out 32-byte (256-bit) hashes. The software stores the hash densely, as 32 bytes, only encoding it to 64 hex characters when the user views it or copies to clipboard. SHA-256 is not perfect, but it is a great compromise candidate; it is secure for now, it is reasonably fast, it is available for most programming languages, and newer CPUs perform it more efficiently all the time.</i></p>
2015-08-12 20:35:24 +00:00
<a name="access_keys"><h3>what is an access key?</h3></a>
2014-10-01 22:58:32 +00:00
<p>The hydrus network's repositories do not use username/password, but instead a single strong identifier-password like this:</p>
2013-12-11 22:09:25 +00:00
<p><i>7ce4dbf18f7af8b420ee942bae42030aab344e91dc0e839260fcd71a4c9879e3</i></p>
<p>These hex numbers give you access to a particular account on a particular repository, and are often combined like so:</p>
<p><i>7ce4dbf18f7af8b420ee942bae42030aab344e91dc0e839260fcd71a4c9879e3@hostname.com:45871</i></p>
<p>They are long enough to be impossible to guess, and also randomly generated, so they reveal nothing personally identifying about you. Many people can use the same access key (and hence the same account) on a repository without consequence, although they will have to share bandwidth limits, and if one person screws around and gets the account banned, everyone will lose access.</p>
2014-10-01 22:58:32 +00:00
<p>The access key is the account. Do not give it to anyone you do not want to have access to the account. An administrator will never need it; instead they will want your <i>account key</i>.</p>
2015-08-12 20:35:24 +00:00
<a name="account_keys"><h3>what is an account key?</h3></a>
<p>This is another long string of random hexadecimal that <i>identifies</i> your account without giving away access. If you need to identify yourself to a repository administrator (say, to get your account's permissions modified), you will need to tell them your account key. You can copy it to your clipboard in <i>services->review services</i>.</p>
2013-03-27 20:02:51 +00:00
<h3>why aren't my swfs showing?</h3>
<p>If an Internet Explorer "Navigation Cancelled" page appears whenever you click on a swf thumbnail, try installing Flash Player for Internet Explorer. Just having it installed for Firefox/Opera is not enough; you need the ActiveX component that comes with the specific IE version. Just boot IE and download/run the installer from Adobe's site.</p>
<a name="delays"><h3>why can my friend not see what I just uploaded?</h3></a>
<p>The repositories do not work like conventional search engines; it takes a short but predictable while for changes to propagate to other users.</p>
2015-08-12 20:35:24 +00:00
<p>The client's searches only ever happen over its local cache of what is on the repository. Any changes you make will be delayed for others until their next update occurs. At the moment, the update period is 100,000 seconds, which is about 1 day and 4 hours.</p>
2013-03-27 20:02:51 +00:00
</div>
</body>
2013-02-19 00:11:43 +00:00
</html>