Mercurial > hgbook
comparison en/ch05-collab.xml @ 682:28b5a5befb08
Fold preface and intro into one
author | Bryan O'Sullivan <bos@serpentine.com> |
---|---|
date | Thu, 19 Mar 2009 20:54:12 -0700 |
parents | en/ch06-collab.xml@8366882f67f2 |
children | c838b3975bc6 |
comparison
equal
deleted
inserted
replaced
681:5bfa0df6aaed | 682:28b5a5befb08 |
---|---|
1 <!-- vim: set filetype=docbkxml shiftwidth=2 autoindent expandtab tw=77 : --> | |
2 | |
3 <chapter id="cha:collab"> | |
4 <?dbhtml filename="collaborating-with-other-people.html"?> | |
5 <title>Collaborating with other people</title> | |
6 | |
7 <para>As a completely decentralised tool, Mercurial doesn't impose | |
8 any policy on how people ought to work with each other. However, | |
9 if you're new to distributed revision control, it helps to have | |
10 some tools and examples in mind when you're thinking about | |
11 possible workflow models.</para> | |
12 | |
13 <sect1> | |
14 <title>Mercurial's web interface</title> | |
15 | |
16 <para>Mercurial has a powerful web interface that provides several | |
17 useful capabilities.</para> | |
18 | |
19 <para>For interactive use, the web interface lets you browse a | |
20 single repository or a collection of repositories. You can view | |
21 the history of a repository, examine each change (comments and | |
22 diffs), and view the contents of each directory and file.</para> | |
23 | |
24 <para>Also for human consumption, the web interface provides an | |
25 RSS feed of the changes in a repository. This lets you | |
26 <quote>subscribe</quote> to a repository using your favourite | |
27 feed reader, and be automatically notified of activity in that | |
28 repository as soon as it happens. I find this capability much | |
29 more convenient than the model of subscribing to a mailing list | |
30 to which notifications are sent, as it requires no additional | |
31 configuration on the part of whoever is serving the | |
32 repository.</para> | |
33 | |
34 <para>The web interface also lets remote users clone a repository, | |
35 pull changes from it, and (when the server is configured to | |
36 permit it) push changes back to it. Mercurial's HTTP tunneling | |
37 protocol aggressively compresses data, so that it works | |
38 efficiently even over low-bandwidth network connections.</para> | |
39 | |
40 <para>The easiest way to get started with the web interface is to | |
41 use your web browser to visit an existing repository, such as | |
42 the master Mercurial repository at <ulink | |
43 url="http://www.selenic.com/repo/hg?style=gitweb">http://www.selenic.com/repo/hg?style=gitweb</ulink>.</para> | |
44 | |
45 <para>If you're interested in providing a web interface to your | |
46 own repositories, Mercurial provides two ways to do this. The | |
47 first is using the <command role="hg-cmd">hg serve</command> | |
48 command, which is best suited to short-term | |
49 <quote>lightweight</quote> serving. See section <xref | |
50 linkend="sec:collab:serve"/> below for details of how to use | |
51 this command. If you have a long-lived repository that you'd | |
52 like to make permanently available, Mercurial has built-in | |
53 support for the CGI (Common Gateway Interface) standard, which | |
54 all common web servers support. See section <xref | |
55 linkend="sec:collab:cgi"/> for details of CGI | |
56 configuration.</para> | |
57 | |
58 </sect1> | |
59 <sect1> | |
60 <title>Collaboration models</title> | |
61 | |
62 <para>With a suitably flexible tool, making decisions about | |
63 workflow is much more of a social engineering challenge than a | |
64 technical one. Mercurial imposes few limitations on how you can | |
65 structure the flow of work in a project, so it's up to you and | |
66 your group to set up and live with a model that matches your own | |
67 particular needs.</para> | |
68 | |
69 <sect2> | |
70 <title>Factors to keep in mind</title> | |
71 | |
72 <para>The most important aspect of any model that you must keep | |
73 in mind is how well it matches the needs and capabilities of | |
74 the people who will be using it. This might seem | |
75 self-evident; even so, you still can't afford to forget it for | |
76 a moment.</para> | |
77 | |
78 <para>I once put together a workflow model that seemed to make | |
79 perfect sense to me, but that caused a considerable amount of | |
80 consternation and strife within my development team. In spite | |
81 of my attempts to explain why we needed a complex set of | |
82 branches, and how changes ought to flow between them, a few | |
83 team members revolted. Even though they were smart people, | |
84 they didn't want to pay attention to the constraints we were | |
85 operating under, or face the consequences of those constraints | |
86 in the details of the model that I was advocating.</para> | |
87 | |
88 <para>Don't sweep foreseeable social or technical problems under | |
89 the rug. Whatever scheme you put into effect, you should plan | |
90 for mistakes and problem scenarios. Consider adding automated | |
91 machinery to prevent, or quickly recover from, trouble that | |
92 you can anticipate. As an example, if you intend to have a | |
93 branch with not-for-release changes in it, you'd do well to | |
94 think early about the possibility that someone might | |
95 accidentally merge those changes into a release branch. You | |
96 could avoid this particular problem by writing a hook that | |
97 prevents changes from being merged from an inappropriate | |
98 branch.</para> | |
99 | |
100 </sect2> | |
101 <sect2> | |
102 <title>Informal anarchy</title> | |
103 | |
104 <para>I wouldn't suggest an <quote>anything goes</quote> | |
105 approach as something sustainable, but it's a model that's | |
106 easy to grasp, and it works perfectly well in a few unusual | |
107 situations.</para> | |
108 | |
109 <para>As one example, many projects have a loose-knit group of | |
110 collaborators who rarely physically meet each other. Some | |
111 groups like to overcome the isolation of working at a distance | |
112 by organising occasional <quote>sprints</quote>. In a sprint, | |
113 a number of people get together in a single location (a | |
114 company's conference room, a hotel meeting room, that kind of | |
115 place) and spend several days more or less locked in there, | |
116 hacking intensely on a handful of projects.</para> | |
117 | |
118 <para>A sprint is the perfect place to use the <command | |
119 role="hg-cmd">hg serve</command> command, since <command | |
120 role="hg-cmd">hg serve</command> does not require any fancy | |
121 server infrastructure. You can get started with <command | |
122 role="hg-cmd">hg serve</command> in moments, by reading | |
123 section <xref linkend="sec:collab:serve"/> below. Then simply | |
124 tell | |
125 the person next to you that you're running a server, send the | |
126 URL to them in an instant message, and you immediately have a | |
127 quick-turnaround way to work together. They can type your URL | |
128 into their web browser and quickly review your changes; or | |
129 they can pull a bugfix from you and verify it; or they can | |
130 clone a branch containing a new feature and try it out.</para> | |
131 | |
132 <para>The charm, and the problem, with doing things in an ad hoc | |
133 fashion like this is that only people who know about your | |
134 changes, and where they are, can see them. Such an informal | |
135 approach simply doesn't scale beyond a handful people, because | |
136 each individual needs to know about $n$ different repositories | |
137 to pull from.</para> | |
138 | |
139 </sect2> | |
140 <sect2> | |
141 <title>A single central repository</title> | |
142 | |
143 <para>For smaller projects migrating from a centralised revision | |
144 control tool, perhaps the easiest way to get started is to | |
145 have changes flow through a single shared central repository. | |
146 This is also the most common <quote>building block</quote> for | |
147 more ambitious workflow schemes.</para> | |
148 | |
149 <para>Contributors start by cloning a copy of this repository. | |
150 They can pull changes from it whenever they need to, and some | |
151 (perhaps all) developers have permission to push a change back | |
152 when they're ready for other people to see it.</para> | |
153 | |
154 <para>Under this model, it can still often make sense for people | |
155 to pull changes directly from each other, without going | |
156 through the central repository. Consider a case in which I | |
157 have a tentative bug fix, but I am worried that if I were to | |
158 publish it to the central repository, it might subsequently | |
159 break everyone else's trees as they pull it. To reduce the | |
160 potential for damage, I can ask you to clone my repository | |
161 into a temporary repository of your own and test it. This | |
162 lets us put off publishing the potentially unsafe change until | |
163 it has had a little testing.</para> | |
164 | |
165 <para>In this kind of scenario, people usually use the | |
166 <command>ssh</command> protocol to securely push changes to | |
167 the central repository, as documented in section <xref | |
168 linkend="sec:collab:ssh"/>. It's also | |
169 usual to publish a read-only copy of the repository over HTTP | |
170 using CGI, as in section <xref linkend="sec:collab:cgi"/>. | |
171 Publishing over HTTP | |
172 satisfies the needs of people who don't have push access, and | |
173 those who want to use web browsers to browse the repository's | |
174 history.</para> | |
175 | |
176 </sect2> | |
177 <sect2> | |
178 <title>Working with multiple branches</title> | |
179 | |
180 <para>Projects of any significant size naturally tend to make | |
181 progress on several fronts simultaneously. In the case of | |
182 software, it's common for a project to go through periodic | |
183 official releases. A release might then go into | |
184 <quote>maintenance mode</quote> for a while after its first | |
185 publication; maintenance releases tend to contain only bug | |
186 fixes, not new features. In parallel with these maintenance | |
187 releases, one or more future releases may be under | |
188 development. People normally use the word | |
189 <quote>branch</quote> to refer to one of these many slightly | |
190 different directions in which development is | |
191 proceeding.</para> | |
192 | |
193 <para>Mercurial is particularly well suited to managing a number | |
194 of simultaneous, but not identical, branches. Each | |
195 <quote>development direction</quote> can live in its own | |
196 central repository, and you can merge changes from one to | |
197 another as the need arises. Because repositories are | |
198 independent of each other, unstable changes in a development | |
199 branch will never affect a stable branch unless someone | |
200 explicitly merges those changes in.</para> | |
201 | |
202 <para>Here's an example of how this can work in practice. Let's | |
203 say you have one <quote>main branch</quote> on a central | |
204 server.</para> | |
205 | |
206 &interaction.branching.init; | |
207 | |
208 <para>People clone it, make changes locally, test them, and push | |
209 them back.</para> | |
210 | |
211 <para>Once the main branch reaches a release milestone, you can | |
212 use the <command role="hg-cmd">hg tag</command> command to | |
213 give a permanent name to the milestone revision.</para> | |
214 | |
215 &interaction.branching.tag; | |
216 | |
217 <para>Let's say some ongoing | |
218 development occurs on the main branch.</para> | |
219 | |
220 &interaction.branching.main; | |
221 | |
222 <para>Using the tag that was recorded at the milestone, people | |
223 who clone that repository at any time in the future can use | |
224 <command role="hg-cmd">hg update</command> to get a copy of | |
225 the working directory exactly as it was when that tagged | |
226 revision was committed.</para> | |
227 | |
228 &interaction.branching.update; | |
229 | |
230 <para>In addition, immediately after the main branch is tagged, | |
231 someone can then clone the main branch on the server to a new | |
232 <quote>stable</quote> branch, also on the server.</para> | |
233 | |
234 &interaction.branching.clone; | |
235 | |
236 <para>Someone who needs to make a change to the stable branch | |
237 can then clone <emphasis>that</emphasis> repository, make | |
238 their changes, commit, and push their changes back there.</para> | |
239 | |
240 &interaction.branching.stable; | |
241 | |
242 <para>Because Mercurial repositories are independent, and | |
243 Mercurial doesn't move changes around automatically, the | |
244 stable and main branches are <emphasis>isolated</emphasis> | |
245 from each other. The changes that you made on the main branch | |
246 don't <quote>leak</quote> to the stable branch, and vice | |
247 versa.</para> | |
248 | |
249 <para>You'll often want all of your bugfixes on the stable | |
250 branch to show up on the main branch, too. Rather than | |
251 rewrite a bugfix on the main branch, you can simply pull and | |
252 merge changes from the stable to the main branch, and | |
253 Mercurial will bring those bugfixes in for you.</para> | |
254 | |
255 &interaction.branching.merge; | |
256 | |
257 <para>The main branch will still contain changes that are not on | |
258 the stable branch, but it will also contain all of the | |
259 bugfixes from the stable branch. The stable branch remains | |
260 unaffected by these changes.</para> | |
261 | |
262 </sect2> | |
263 <sect2> | |
264 <title>Feature branches</title> | |
265 | |
266 <para>For larger projects, an effective way to manage change is | |
267 to break up a team into smaller groups. Each group has a | |
268 shared branch of its own, cloned from a single | |
269 <quote>master</quote> branch used by the entire project. | |
270 People working on an individual branch are typically quite | |
271 isolated from developments on other branches.</para> | |
272 | |
273 <informalfigure id="fig:collab:feature-branches"> | |
274 <mediaobject><imageobject><imagedata | |
275 fileref="feature-branches"/></imageobject><textobject><phrase>XXX | |
276 add text</phrase></textobject><caption><para>Feature | |
277 branches</para></caption></mediaobject> | |
278 </informalfigure> | |
279 | |
280 <para>When a particular feature is deemed to be in suitable | |
281 shape, someone on that feature team pulls and merges from the | |
282 master branch into the feature branch, then pushes back up to | |
283 the master branch.</para> | |
284 | |
285 </sect2> | |
286 <sect2> | |
287 <title>The release train</title> | |
288 | |
289 <para>Some projects are organised on a <quote>train</quote> | |
290 basis: a release is scheduled to happen every few months, and | |
291 whatever features are ready when the <quote>train</quote> is | |
292 ready to leave are allowed in.</para> | |
293 | |
294 <para>This model resembles working with feature branches. The | |
295 difference is that when a feature branch misses a train, | |
296 someone on the feature team pulls and merges the changes that | |
297 went out on that train release into the feature branch, and | |
298 the team continues its work on top of that release so that | |
299 their feature can make the next release.</para> | |
300 | |
301 </sect2> | |
302 <sect2> | |
303 <title>The Linux kernel model</title> | |
304 | |
305 <para>The development of the Linux kernel has a shallow | |
306 hierarchical structure, surrounded by a cloud of apparent | |
307 chaos. Because most Linux developers use | |
308 <command>git</command>, a distributed revision control tool | |
309 with capabilities similar to Mercurial, it's useful to | |
310 describe the way work flows in that environment; if you like | |
311 the ideas, the approach translates well across tools.</para> | |
312 | |
313 <para>At the center of the community sits Linus Torvalds, the | |
314 creator of Linux. He publishes a single source repository | |
315 that is considered the <quote>authoritative</quote> current | |
316 tree by the entire developer community. Anyone can clone | |
317 Linus's tree, but he is very choosy about whose trees he pulls | |
318 from.</para> | |
319 | |
320 <para>Linus has a number of <quote>trusted lieutenants</quote>. | |
321 As a general rule, he pulls whatever changes they publish, in | |
322 most cases without even reviewing those changes. Some of | |
323 those lieutenants are generally agreed to be | |
324 <quote>maintainers</quote>, responsible for specific | |
325 subsystems within the kernel. If a random kernel hacker wants | |
326 to make a change to a subsystem that they want to end up in | |
327 Linus's tree, they must find out who the subsystem's | |
328 maintainer is, and ask that maintainer to take their change. | |
329 If the maintainer reviews their changes and agrees to take | |
330 them, they'll pass them along to Linus in due course.</para> | |
331 | |
332 <para>Individual lieutenants have their own approaches to | |
333 reviewing, accepting, and publishing changes; and for deciding | |
334 when to feed them to Linus. In addition, there are several | |
335 well known branches that people use for different purposes. | |
336 For example, a few people maintain <quote>stable</quote> | |
337 repositories of older versions of the kernel, to which they | |
338 apply critical fixes as needed. Some maintainers publish | |
339 multiple trees: one for experimental changes; one for changes | |
340 that they are about to feed upstream; and so on. Others just | |
341 publish a single tree.</para> | |
342 | |
343 <para>This model has two notable features. The first is that | |
344 it's <quote>pull only</quote>. You have to ask, convince, or | |
345 beg another developer to take a change from you, because there | |
346 are almost no trees to which more than one person can push, | |
347 and there's no way to push changes into a tree that someone | |
348 else controls.</para> | |
349 | |
350 <para>The second is that it's based on reputation and acclaim. | |
351 If you're an unknown, Linus will probably ignore changes from | |
352 you without even responding. But a subsystem maintainer will | |
353 probably review them, and will likely take them if they pass | |
354 their criteria for suitability. The more <quote>good</quote> | |
355 changes you contribute to a maintainer, the more likely they | |
356 are to trust your judgment and accept your changes. If you're | |
357 well-known and maintain a long-lived branch for something | |
358 Linus hasn't yet accepted, people with similar interests may | |
359 pull your changes regularly to keep up with your work.</para> | |
360 | |
361 <para>Reputation and acclaim don't necessarily cross subsystem | |
362 or <quote>people</quote> boundaries. If you're a respected | |
363 but specialised storage hacker, and you try to fix a | |
364 networking bug, that change will receive a level of scrutiny | |
365 from a network maintainer comparable to a change from a | |
366 complete stranger.</para> | |
367 | |
368 <para>To people who come from more orderly project backgrounds, | |
369 the comparatively chaotic Linux kernel development process | |
370 often seems completely insane. It's subject to the whims of | |
371 individuals; people make sweeping changes whenever they deem | |
372 it appropriate; and the pace of development is astounding. | |
373 And yet Linux is a highly successful, well-regarded piece of | |
374 software.</para> | |
375 | |
376 </sect2> | |
377 <sect2> | |
378 <title>Pull-only versus shared-push collaboration</title> | |
379 | |
380 <para>A perpetual source of heat in the open source community is | |
381 whether a development model in which people only ever pull | |
382 changes from others is <quote>better than</quote> one in which | |
383 multiple people can push changes to a shared | |
384 repository.</para> | |
385 | |
386 <para>Typically, the backers of the shared-push model use tools | |
387 that actively enforce this approach. If you're using a | |
388 centralised revision control tool such as Subversion, there's | |
389 no way to make a choice over which model you'll use: the tool | |
390 gives you shared-push, and if you want to do anything else, | |
391 you'll have to roll your own approach on top (such as applying | |
392 a patch by hand).</para> | |
393 | |
394 <para>A good distributed revision control tool, such as | |
395 Mercurial, will support both models. You and your | |
396 collaborators can then structure how you work together based | |
397 on your own needs and preferences, not on what contortions | |
398 your tools force you into.</para> | |
399 | |
400 </sect2> | |
401 <sect2> | |
402 <title>Where collaboration meets branch management</title> | |
403 | |
404 <para>Once you and your team set up some shared repositories and | |
405 start propagating changes back and forth between local and | |
406 shared repos, you begin to face a related, but slightly | |
407 different challenge: that of managing the multiple directions | |
408 in which your team may be moving at once. Even though this | |
409 subject is intimately related to how your team collaborates, | |
410 it's dense enough to merit treatment of its own, in chapter | |
411 <xref linkend="chap:branch"/>.</para> | |
412 | |
413 </sect2> | |
414 </sect1> | |
415 <sect1> | |
416 <title>The technical side of sharing</title> | |
417 | |
418 <para>The remainder of this chapter is devoted to the question of | |
419 serving data to your collaborators.</para> | |
420 | |
421 </sect1> | |
422 <sect1 id="sec:collab:serve"> | |
423 <title>Informal sharing with <command role="hg-cmd">hg | |
424 serve</command></title> | |
425 | |
426 <para>Mercurial's <command role="hg-cmd">hg serve</command> | |
427 command is wonderfully suited to small, tight-knit, and | |
428 fast-paced group environments. It also provides a great way to | |
429 get a feel for using Mercurial commands over a network.</para> | |
430 | |
431 <para>Run <command role="hg-cmd">hg serve</command> inside a | |
432 repository, and in under a second it will bring up a specialised | |
433 HTTP server; this will accept connections from any client, and | |
434 serve up data for that repository until you terminate it. | |
435 Anyone who knows the URL of the server you just started, and can | |
436 talk to your computer over the network, can then use a web | |
437 browser or Mercurial to read data from that repository. A URL | |
438 for a <command role="hg-cmd">hg serve</command> instance running | |
439 on a laptop is likely to look something like | |
440 <literal>http://my-laptop.local:8000/</literal>.</para> | |
441 | |
442 <para>The <command role="hg-cmd">hg serve</command> command is | |
443 <emphasis>not</emphasis> a general-purpose web server. It can do | |
444 only two things:</para> | |
445 <itemizedlist> | |
446 <listitem><para>Allow people to browse the history of the | |
447 repository it's serving, from their normal web | |
448 browsers.</para> | |
449 </listitem> | |
450 <listitem><para>Speak Mercurial's wire protocol, so that people | |
451 can <command role="hg-cmd">hg clone</command> or <command | |
452 role="hg-cmd">hg pull</command> changes from that | |
453 repository.</para> | |
454 </listitem></itemizedlist> | |
455 <para>In particular, <command role="hg-cmd">hg serve</command> | |
456 won't allow remote users to <emphasis>modify</emphasis> your | |
457 repository. It's intended for read-only use.</para> | |
458 | |
459 <para>If you're getting started with Mercurial, there's nothing to | |
460 prevent you from using <command role="hg-cmd">hg serve</command> | |
461 to serve up a repository on your own computer, then use commands | |
462 like <command role="hg-cmd">hg clone</command>, <command | |
463 role="hg-cmd">hg incoming</command>, and so on to talk to that | |
464 server as if the repository was hosted remotely. This can help | |
465 you to quickly get acquainted with using commands on | |
466 network-hosted repositories.</para> | |
467 | |
468 <sect2> | |
469 <title>A few things to keep in mind</title> | |
470 | |
471 <para>Because it provides unauthenticated read access to all | |
472 clients, you should only use <command role="hg-cmd">hg | |
473 serve</command> in an environment where you either don't | |
474 care, or have complete control over, who can access your | |
475 network and pull data from your repository.</para> | |
476 | |
477 <para>The <command role="hg-cmd">hg serve</command> command | |
478 knows nothing about any firewall software you might have | |
479 installed on your system or network. It cannot detect or | |
480 control your firewall software. If other people are unable to | |
481 talk to a running <command role="hg-cmd">hg serve</command> | |
482 instance, the second thing you should do | |
483 (<emphasis>after</emphasis> you make sure that they're using | |
484 the correct URL) is check your firewall configuration.</para> | |
485 | |
486 <para>By default, <command role="hg-cmd">hg serve</command> | |
487 listens for incoming connections on port 8000. If another | |
488 process is already listening on the port you want to use, you | |
489 can specify a different port to listen on using the <option | |
490 role="hg-opt-serve">-p</option> option.</para> | |
491 | |
492 <para>Normally, when <command role="hg-cmd">hg serve</command> | |
493 starts, it prints no output, which can be a bit unnerving. If | |
494 you'd like to confirm that it is indeed running correctly, and | |
495 find out what URL you should send to your collaborators, start | |
496 it with the <option role="hg-opt-global">-v</option> | |
497 option.</para> | |
498 | |
499 </sect2> | |
500 </sect1> | |
501 <sect1 id="sec:collab:ssh"> | |
502 <title>Using the Secure Shell (ssh) protocol</title> | |
503 | |
504 <para>You can pull and push changes securely over a network | |
505 connection using the Secure Shell (<literal>ssh</literal>) | |
506 protocol. To use this successfully, you may have to do a little | |
507 bit of configuration on the client or server sides.</para> | |
508 | |
509 <para>If you're not familiar with ssh, it's a network protocol | |
510 that lets you securely communicate with another computer. To | |
511 use it with Mercurial, you'll be setting up one or more user | |
512 accounts on a server so that remote users can log in and execute | |
513 commands.</para> | |
514 | |
515 <para>(If you <emphasis>are</emphasis> familiar with ssh, you'll | |
516 probably find some of the material that follows to be elementary | |
517 in nature.)</para> | |
518 | |
519 <sect2> | |
520 <title>How to read and write ssh URLs</title> | |
521 | |
522 <para>An ssh URL tends to look like this:</para> | |
523 <programlisting>ssh://bos@hg.serpentine.com:22/hg/hgbook</programlisting> | |
524 <orderedlist> | |
525 <listitem><para>The <quote><literal>ssh://</literal></quote> | |
526 part tells Mercurial to use the ssh protocol.</para> | |
527 </listitem> | |
528 <listitem><para>The <quote><literal>bos@</literal></quote> | |
529 component indicates what username to log into the server | |
530 as. You can leave this out if the remote username is the | |
531 same as your local username.</para> | |
532 </listitem> | |
533 <listitem><para>The | |
534 <quote><literal>hg.serpentine.com</literal></quote> gives | |
535 the hostname of the server to log into.</para> | |
536 </listitem> | |
537 <listitem><para>The <quote>:22</quote> identifies the port | |
538 number to connect to the server on. The default port is | |
539 22, so you only need to specify a colon and port number if | |
540 you're <emphasis>not</emphasis> using port 22.</para> | |
541 </listitem> | |
542 <listitem><para>The remainder of the URL is the local path to | |
543 the repository on the server.</para> | |
544 </listitem></orderedlist> | |
545 | |
546 <para>There's plenty of scope for confusion with the path | |
547 component of ssh URLs, as there is no standard way for tools | |
548 to interpret it. Some programs behave differently than others | |
549 when dealing with these paths. This isn't an ideal situation, | |
550 but it's unlikely to change. Please read the following | |
551 paragraphs carefully.</para> | |
552 | |
553 <para>Mercurial treats the path to a repository on the server as | |
554 relative to the remote user's home directory. For example, if | |
555 user <literal>foo</literal> on the server has a home directory | |
556 of <filename class="directory">/home/foo</filename>, then an | |
557 ssh URL that contains a path component of <filename | |
558 class="directory">bar</filename> <emphasis>really</emphasis> | |
559 refers to the directory <filename | |
560 class="directory">/home/foo/bar</filename>.</para> | |
561 | |
562 <para>If you want to specify a path relative to another user's | |
563 home directory, you can use a path that starts with a tilde | |
564 character followed by the user's name (let's call them | |
565 <literal>otheruser</literal>), like this.</para> | |
566 <programlisting>ssh://server/~otheruser/hg/repo</programlisting> | |
567 | |
568 <para>And if you really want to specify an | |
569 <emphasis>absolute</emphasis> path on the server, begin the | |
570 path component with two slashes, as in this example.</para> | |
571 <programlisting>ssh://server//absolute/path</programlisting> | |
572 | |
573 </sect2> | |
574 <sect2> | |
575 <title>Finding an ssh client for your system</title> | |
576 | |
577 <para>Almost every Unix-like system comes with OpenSSH | |
578 preinstalled. If you're using such a system, run | |
579 <literal>which ssh</literal> to find out if the | |
580 <command>ssh</command> command is installed (it's usually in | |
581 <filename class="directory">/usr/bin</filename>). In the | |
582 unlikely event that it isn't present, take a look at your | |
583 system documentation to figure out how to install it.</para> | |
584 | |
585 <para>On Windows, you'll first need to download a suitable ssh | |
586 client. There are two alternatives.</para> | |
587 <itemizedlist> | |
588 <listitem><para>Simon Tatham's excellent PuTTY package | |
589 <citation>web:putty</citation> provides a complete suite | |
590 of ssh client commands.</para> | |
591 </listitem> | |
592 <listitem><para>If you have a high tolerance for pain, you can | |
593 use the Cygwin port of OpenSSH.</para> | |
594 </listitem></itemizedlist> | |
595 <para>In either case, you'll need to edit your <filename | |
596 role="special">hg.ini</filename> file to | |
597 tell Mercurial where to find the actual client command. For | |
598 example, if you're using PuTTY, you'll need to use the | |
599 <command>plink</command> command as a command-line ssh | |
600 client.</para> | |
601 <programlisting>[ui] | |
602 ssh = C:/path/to/plink.exe -ssh -i "C:/path/to/my/private/key"</programlisting> | |
603 | |
604 <note> | |
605 <para> The path to <command>plink</command> shouldn't contain | |
606 any whitespace characters, or Mercurial may not be able to | |
607 run it correctly (so putting it in <filename | |
608 class="directory">C:\Program Files</filename> is probably | |
609 not a good idea).</para> | |
610 </note> | |
611 | |
612 </sect2> | |
613 <sect2> | |
614 <title>Generating a key pair</title> | |
615 | |
616 <para>To avoid the need to repetitively type a password every | |
617 time you need to use your ssh client, I recommend generating a | |
618 key pair. On a Unix-like system, the | |
619 <command>ssh-keygen</command> command will do the trick. On | |
620 Windows, if you're using PuTTY, the | |
621 <command>puttygen</command> command is what you'll | |
622 need.</para> | |
623 | |
624 <para>When you generate a key pair, it's usually | |
625 <emphasis>highly</emphasis> advisable to protect it with a | |
626 passphrase. (The only time that you might not want to do this | |
627 is when you're using the ssh protocol for automated tasks on a | |
628 secure network.)</para> | |
629 | |
630 <para>Simply generating a key pair isn't enough, however. | |
631 You'll need to add the public key to the set of authorised | |
632 keys for whatever user you're logging in remotely as. For | |
633 servers using OpenSSH (the vast majority), this will mean | |
634 adding the public key to a list in a file called <filename | |
635 role="special">authorized_keys</filename> in their <filename | |
636 role="special" class="directory">.ssh</filename> | |
637 directory.</para> | |
638 | |
639 <para>On a Unix-like system, your public key will have a | |
640 <filename>.pub</filename> extension. If you're using | |
641 <command>puttygen</command> on Windows, you can save the | |
642 public key to a file of your choosing, or paste it from the | |
643 window it's displayed in straight into the <filename | |
644 role="special">authorized_keys</filename> file.</para> | |
645 | |
646 </sect2> | |
647 <sect2> | |
648 <title>Using an authentication agent</title> | |
649 | |
650 <para>An authentication agent is a daemon that stores | |
651 passphrases in memory (so it will forget passphrases if you | |
652 log out and log back in again). An ssh client will notice if | |
653 it's running, and query it for a passphrase. If there's no | |
654 authentication agent running, or the agent doesn't store the | |
655 necessary passphrase, you'll have to type your passphrase | |
656 every time Mercurial tries to communicate with a server on | |
657 your behalf (e.g. whenever you pull or push changes).</para> | |
658 | |
659 <para>The downside of storing passphrases in an agent is that | |
660 it's possible for a well-prepared attacker to recover the | |
661 plain text of your passphrases, in some cases even if your | |
662 system has been power-cycled. You should make your own | |
663 judgment as to whether this is an acceptable risk. It | |
664 certainly saves a lot of repeated typing.</para> | |
665 | |
666 <para>On Unix-like systems, the agent is called | |
667 <command>ssh-agent</command>, and it's often run automatically | |
668 for you when you log in. You'll need to use the | |
669 <command>ssh-add</command> command to add passphrases to the | |
670 agent's store. On Windows, if you're using PuTTY, the | |
671 <command>pageant</command> command acts as the agent. It adds | |
672 an icon to your system tray that will let you manage stored | |
673 passphrases.</para> | |
674 | |
675 </sect2> | |
676 <sect2> | |
677 <title>Configuring the server side properly</title> | |
678 | |
679 <para>Because ssh can be fiddly to set up if you're new to it, | |
680 there's a variety of things that can go wrong. Add Mercurial | |
681 on top, and there's plenty more scope for head-scratching. | |
682 Most of these potential problems occur on the server side, not | |
683 the client side. The good news is that once you've gotten a | |
684 configuration working, it will usually continue to work | |
685 indefinitely.</para> | |
686 | |
687 <para>Before you try using Mercurial to talk to an ssh server, | |
688 it's best to make sure that you can use the normal | |
689 <command>ssh</command> or <command>putty</command> command to | |
690 talk to the server first. If you run into problems with using | |
691 these commands directly, Mercurial surely won't work. Worse, | |
692 it will obscure the underlying problem. Any time you want to | |
693 debug ssh-related Mercurial problems, you should drop back to | |
694 making sure that plain ssh client commands work first, | |
695 <emphasis>before</emphasis> you worry about whether there's a | |
696 problem with Mercurial.</para> | |
697 | |
698 <para>The first thing to be sure of on the server side is that | |
699 you can actually log in from another machine at all. If you | |
700 can't use <command>ssh</command> or <command>putty</command> | |
701 to log in, the error message you get may give you a few hints | |
702 as to what's wrong. The most common problems are as | |
703 follows.</para> | |
704 <itemizedlist> | |
705 <listitem><para>If you get a <quote>connection refused</quote> | |
706 error, either there isn't an SSH daemon running on the | |
707 server at all, or it's inaccessible due to firewall | |
708 configuration.</para> | |
709 </listitem> | |
710 <listitem><para>If you get a <quote>no route to host</quote> | |
711 error, you either have an incorrect address for the server | |
712 or a seriously locked down firewall that won't admit its | |
713 existence at all.</para> | |
714 </listitem> | |
715 <listitem><para>If you get a <quote>permission denied</quote> | |
716 error, you may have mistyped the username on the server, | |
717 or you could have mistyped your key's passphrase or the | |
718 remote user's password.</para> | |
719 </listitem></itemizedlist> | |
720 <para>In summary, if you're having trouble talking to the | |
721 server's ssh daemon, first make sure that one is running at | |
722 all. On many systems it will be installed, but disabled, by | |
723 default. Once you're done with this step, you should then | |
724 check that the server's firewall is configured to allow | |
725 incoming connections on the port the ssh daemon is listening | |
726 on (usually 22). Don't worry about more exotic possibilities | |
727 for misconfiguration until you've checked these two | |
728 first.</para> | |
729 | |
730 <para>If you're using an authentication agent on the client side | |
731 to store passphrases for your keys, you ought to be able to | |
732 log into the server without being prompted for a passphrase or | |
733 a password. If you're prompted for a passphrase, there are a | |
734 few possible culprits.</para> | |
735 <itemizedlist> | |
736 <listitem><para>You might have forgotten to use | |
737 <command>ssh-add</command> or <command>pageant</command> | |
738 to store the passphrase.</para> | |
739 </listitem> | |
740 <listitem><para>You might have stored the passphrase for the | |
741 wrong key.</para> | |
742 </listitem></itemizedlist> | |
743 <para>If you're being prompted for the remote user's password, | |
744 there are another few possible problems to check.</para> | |
745 <itemizedlist> | |
746 <listitem><para>Either the user's home directory or their | |
747 <filename role="special" class="directory">.ssh</filename> | |
748 directory might have excessively liberal permissions. As | |
749 a result, the ssh daemon will not trust or read their | |
750 <filename role="special">authorized_keys</filename> file. | |
751 For example, a group-writable home or <filename | |
752 role="special" class="directory">.ssh</filename> | |
753 directory will often cause this symptom.</para> | |
754 </listitem> | |
755 <listitem><para>The user's <filename | |
756 role="special">authorized_keys</filename> file may have | |
757 a problem. If anyone other than the user owns or can write | |
758 to that file, the ssh daemon will not trust or read | |
759 it.</para> | |
760 </listitem></itemizedlist> | |
761 | |
762 <para>In the ideal world, you should be able to run the | |
763 following command successfully, and it should print exactly | |
764 one line of output, the current date and time.</para> | |
765 <programlisting>ssh myserver date</programlisting> | |
766 | |
767 <para>If, on your server, you have login scripts that print | |
768 banners or other junk even when running non-interactive | |
769 commands like this, you should fix them before you continue, | |
770 so that they only print output if they're run interactively. | |
771 Otherwise these banners will at least clutter up Mercurial's | |
772 output. Worse, they could potentially cause problems with | |
773 running Mercurial commands remotely. Mercurial makes tries to | |
774 detect and ignore banners in non-interactive | |
775 <command>ssh</command> sessions, but it is not foolproof. (If | |
776 you're editing your login scripts on your server, the usual | |
777 way to see if a login script is running in an interactive | |
778 shell is to check the return code from the command | |
779 <literal>tty -s</literal>.)</para> | |
780 | |
781 <para>Once you've verified that plain old ssh is working with | |
782 your server, the next step is to ensure that Mercurial runs on | |
783 the server. The following command should run | |
784 successfully:</para> | |
785 | |
786 <programlisting>ssh myserver hg version</programlisting> | |
787 | |
788 <para>If you see an error message instead of normal <command | |
789 role="hg-cmd">hg version</command> output, this is usually | |
790 because you haven't installed Mercurial to <filename | |
791 class="directory">/usr/bin</filename>. Don't worry if this | |
792 is the case; you don't need to do that. But you should check | |
793 for a few possible problems.</para> | |
794 <itemizedlist> | |
795 <listitem><para>Is Mercurial really installed on the server at | |
796 all? I know this sounds trivial, but it's worth | |
797 checking!</para> | |
798 </listitem> | |
799 <listitem><para>Maybe your shell's search path (usually set | |
800 via the <envar>PATH</envar> environment variable) is | |
801 simply misconfigured.</para> | |
802 </listitem> | |
803 <listitem><para>Perhaps your <envar>PATH</envar> environment | |
804 variable is only being set to point to the location of the | |
805 <command>hg</command> executable if the login session is | |
806 interactive. This can happen if you're setting the path | |
807 in the wrong shell login script. See your shell's | |
808 documentation for details.</para> | |
809 </listitem> | |
810 <listitem><para>The <envar>PYTHONPATH</envar> environment | |
811 variable may need to contain the path to the Mercurial | |
812 Python modules. It might not be set at all; it could be | |
813 incorrect; or it may be set only if the login is | |
814 interactive.</para> | |
815 </listitem></itemizedlist> | |
816 | |
817 <para>If you can run <command role="hg-cmd">hg version</command> | |
818 over an ssh connection, well done! You've got the server and | |
819 client sorted out. You should now be able to use Mercurial to | |
820 access repositories hosted by that username on that server. | |
821 If you run into problems with Mercurial and ssh at this point, | |
822 try using the <option role="hg-opt-global">--debug</option> | |
823 option to get a clearer picture of what's going on.</para> | |
824 | |
825 </sect2> | |
826 <sect2> | |
827 <title>Using compression with ssh</title> | |
828 | |
829 <para>Mercurial does not compress data when it uses the ssh | |
830 protocol, because the ssh protocol can transparently compress | |
831 data. However, the default behaviour of ssh clients is | |
832 <emphasis>not</emphasis> to request compression.</para> | |
833 | |
834 <para>Over any network other than a fast LAN (even a wireless | |
835 network), using compression is likely to significantly speed | |
836 up Mercurial's network operations. For example, over a WAN, | |
837 someone measured compression as reducing the amount of time | |
838 required to clone a particularly large repository from 51 | |
839 minutes to 17 minutes.</para> | |
840 | |
841 <para>Both <command>ssh</command> and <command>plink</command> | |
842 accept a <option role="cmd-opt-ssh">-C</option> option which | |
843 turns on compression. You can easily edit your <filename | |
844 role="special">~/.hgrc</filename> to enable compression for | |
845 all of Mercurial's uses of the ssh protocol.</para> | |
846 <programlisting>[ui] | |
847 ssh = ssh -C</programlisting> | |
848 | |
849 <para>If you use <command>ssh</command>, you can configure it to | |
850 always use compression when talking to your server. To do | |
851 this, edit your <filename | |
852 role="special">.ssh/config</filename> file (which may not | |
853 yet exist), as follows.</para> | |
854 <programlisting>Host hg | |
855 Compression yes | |
856 HostName hg.example.com</programlisting> | |
857 <para>This defines an alias, <literal>hg</literal>. When you | |
858 use it on the <command>ssh</command> command line or in a | |
859 Mercurial <literal>ssh</literal>-protocol URL, it will cause | |
860 <command>ssh</command> to connect to | |
861 <literal>hg.example.com</literal> and use compression. This | |
862 gives you both a shorter name to type and compression, each of | |
863 which is a good thing in its own right.</para> | |
864 | |
865 </sect2> | |
866 </sect1> | |
867 <sect1 id="sec:collab:cgi"> | |
868 <title>Serving over HTTP using CGI</title> | |
869 | |
870 <para>Depending on how ambitious you are, configuring Mercurial's | |
871 CGI interface can take anything from a few moments to several | |
872 hours.</para> | |
873 | |
874 <para>We'll begin with the simplest of examples, and work our way | |
875 towards a more complex configuration. Even for the most basic | |
876 case, you're almost certainly going to need to read and modify | |
877 your web server's configuration.</para> | |
878 | |
879 <note> | |
880 <para> Configuring a web server is a complex, fiddly, and | |
881 highly system-dependent activity. I can't possibly give you | |
882 instructions that will cover anything like all of the cases | |
883 you will encounter. Please use your discretion and judgment in | |
884 following the sections below. Be prepared to make plenty of | |
885 mistakes, and to spend a lot of time reading your server's | |
886 error logs.</para> | |
887 </note> | |
888 | |
889 <sect2> | |
890 <title>Web server configuration checklist</title> | |
891 | |
892 <para>Before you continue, do take a few moments to check a few | |
893 aspects of your system's setup.</para> | |
894 | |
895 <orderedlist> | |
896 <listitem><para>Do you have a web server installed at all? | |
897 Mac OS X ships with Apache, but many other systems may not | |
898 have a web server installed.</para> | |
899 </listitem> | |
900 <listitem><para>If you have a web server installed, is it | |
901 actually running? On most systems, even if one is | |
902 present, it will be disabled by default.</para> | |
903 </listitem> | |
904 <listitem><para>Is your server configured to allow you to run | |
905 CGI programs in the directory where you plan to do so? | |
906 Most servers default to explicitly disabling the ability | |
907 to run CGI programs.</para> | |
908 </listitem></orderedlist> | |
909 | |
910 <para>If you don't have a web server installed, and don't have | |
911 substantial experience configuring Apache, you should consider | |
912 using the <literal>lighttpd</literal> web server instead of | |
913 Apache. Apache has a well-deserved reputation for baroque and | |
914 confusing configuration. While <literal>lighttpd</literal> is | |
915 less capable in some ways than Apache, most of these | |
916 capabilities are not relevant to serving Mercurial | |
917 repositories. And <literal>lighttpd</literal> is undeniably | |
918 <emphasis>much</emphasis> easier to get started with than | |
919 Apache.</para> | |
920 | |
921 </sect2> | |
922 <sect2> | |
923 <title>Basic CGI configuration</title> | |
924 | |
925 <para>On Unix-like systems, it's common for users to have a | |
926 subdirectory named something like <filename | |
927 class="directory">public_html</filename> in their home | |
928 directory, from which they can serve up web pages. A file | |
929 named <filename>foo</filename> in this directory will be | |
930 accessible at a URL of the form | |
931 <literal>http://www.example.com/username/foo</literal>.</para> | |
932 | |
933 <para>To get started, find the <filename | |
934 role="special">hgweb.cgi</filename> script that should be | |
935 present in your Mercurial installation. If you can't quickly | |
936 find a local copy on your system, simply download one from the | |
937 master Mercurial repository at <ulink | |
938 url="http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi</ulink>.</para> | |
939 | |
940 <para>You'll need to copy this script into your <filename | |
941 class="directory">public_html</filename> directory, and | |
942 ensure that it's executable.</para> | |
943 <programlisting>cp .../hgweb.cgi ~/public_html | |
944 chmod 755 ~/public_html/hgweb.cgi</programlisting> | |
945 <para>The <literal>755</literal> argument to | |
946 <command>chmod</command> is a little more general than just | |
947 making the script executable: it ensures that the script is | |
948 executable by anyone, and that <quote>group</quote> and | |
949 <quote>other</quote> write permissions are | |
950 <emphasis>not</emphasis> set. If you were to leave those | |
951 write permissions enabled, Apache's <literal>suexec</literal> | |
952 subsystem would likely refuse to execute the script. In fact, | |
953 <literal>suexec</literal> also insists that the | |
954 <emphasis>directory</emphasis> in which the script resides | |
955 must not be writable by others.</para> | |
956 <programlisting>chmod 755 ~/public_html</programlisting> | |
957 | |
958 <sect3 id="sec:collab:wtf"> | |
959 <title>What could <emphasis>possibly</emphasis> go | |
960 wrong?</title> | |
961 | |
962 <para>Once you've copied the CGI script into place, go into a | |
963 web browser, and try to open the URL <ulink | |
964 url="http://myhostname/ | |
965 myuser/hgweb.cgi">http://myhostname/ | |
966 myuser/hgweb.cgi</ulink>, <emphasis>but</emphasis> brace | |
967 yourself for instant failure. There's a high probability | |
968 that trying to visit this URL will fail, and there are many | |
969 possible reasons for this. In fact, you're likely to | |
970 stumble over almost every one of the possible errors below, | |
971 so please read carefully. The following are all of the | |
972 problems I ran into on a system running Fedora 7, with a | |
973 fresh installation of Apache, and a user account that I | |
974 created specially to perform this exercise.</para> | |
975 | |
976 <para>Your web server may have per-user directories disabled. | |
977 If you're using Apache, search your config file for a | |
978 <literal>UserDir</literal> directive. If there's none | |
979 present, per-user directories will be disabled. If one | |
980 exists, but its value is <literal>disabled</literal>, then | |
981 per-user directories will be disabled. Otherwise, the | |
982 string after <literal>UserDir</literal> gives the name of | |
983 the subdirectory that Apache will look in under your home | |
984 directory, for example <filename | |
985 class="directory">public_html</filename>.</para> | |
986 | |
987 <para>Your file access permissions may be too restrictive. | |
988 The web server must be able to traverse your home directory | |
989 and directories under your <filename | |
990 class="directory">public_html</filename> directory, and | |
991 read files under the latter too. Here's a quick recipe to | |
992 help you to make your permissions more appropriate.</para> | |
993 <programlisting>chmod 755 ~ | |
994 find ~/public_html -type d -print0 | xargs -0r chmod 755 | |
995 find ~/public_html -type f -print0 | xargs -0r chmod 644</programlisting> | |
996 | |
997 <para>The other possibility with permissions is that you might | |
998 get a completely empty window when you try to load the | |
999 script. In this case, it's likely that your access | |
1000 permissions are <emphasis>too permissive</emphasis>. Apache's | |
1001 <literal>suexec</literal> subsystem won't execute a script | |
1002 that's group- or world-writable, for example.</para> | |
1003 | |
1004 <para>Your web server may be configured to disallow execution | |
1005 of CGI programs in your per-user web directory. Here's | |
1006 Apache's default per-user configuration from my Fedora | |
1007 system.</para> | |
1008 | |
1009 &ch06-apache-config.lst; | |
1010 | |
1011 <para>If you find a similar-looking | |
1012 <literal>Directory</literal> group in your Apache | |
1013 configuration, the directive to look at inside it is | |
1014 <literal>Options</literal>. Add <literal>ExecCGI</literal> | |
1015 to the end of this list if it's missing, and restart the web | |
1016 server.</para> | |
1017 | |
1018 <para>If you find that Apache serves you the text of the CGI | |
1019 script instead of executing it, you may need to either | |
1020 uncomment (if already present) or add a directive like | |
1021 this.</para> | |
1022 <programlisting>AddHandler cgi-script .cgi</programlisting> | |
1023 | |
1024 <para>The next possibility is that you might be served with a | |
1025 colourful Python backtrace claiming that it can't import a | |
1026 <literal>mercurial</literal>-related module. This is | |
1027 actually progress! The server is now capable of executing | |
1028 your CGI script. This error is only likely to occur if | |
1029 you're running a private installation of Mercurial, instead | |
1030 of a system-wide version. Remember that the web server runs | |
1031 the CGI program without any of the environment variables | |
1032 that you take for granted in an interactive session. If | |
1033 this error happens to you, edit your copy of <filename | |
1034 role="special">hgweb.cgi</filename> and follow the | |
1035 directions inside it to correctly set your | |
1036 <envar>PYTHONPATH</envar> environment variable.</para> | |
1037 | |
1038 <para>Finally, you are <emphasis>certain</emphasis> to by | |
1039 served with another colourful Python backtrace: this one | |
1040 will complain that it can't find <filename | |
1041 class="directory">/path/to/repository</filename>. Edit | |
1042 your <filename role="special">hgweb.cgi</filename> script | |
1043 and replace the <filename | |
1044 class="directory">/path/to/repository</filename> string | |
1045 with the complete path to the repository you want to serve | |
1046 up.</para> | |
1047 | |
1048 <para>At this point, when you try to reload the page, you | |
1049 should be presented with a nice HTML view of your | |
1050 repository's history. Whew!</para> | |
1051 | |
1052 </sect3> | |
1053 <sect3> | |
1054 <title>Configuring lighttpd</title> | |
1055 | |
1056 <para>To be exhaustive in my experiments, I tried configuring | |
1057 the increasingly popular <literal>lighttpd</literal> web | |
1058 server to serve the same repository as I described with | |
1059 Apache above. I had already overcome all of the problems I | |
1060 outlined with Apache, many of which are not server-specific. | |
1061 As a result, I was fairly sure that my file and directory | |
1062 permissions were good, and that my <filename | |
1063 role="special">hgweb.cgi</filename> script was properly | |
1064 edited.</para> | |
1065 | |
1066 <para>Once I had Apache running, getting | |
1067 <literal>lighttpd</literal> to serve the repository was a | |
1068 snap (in other words, even if you're trying to use | |
1069 <literal>lighttpd</literal>, you should read the Apache | |
1070 section). I first had to edit the | |
1071 <literal>mod_access</literal> section of its config file to | |
1072 enable <literal>mod_cgi</literal> and | |
1073 <literal>mod_userdir</literal>, both of which were disabled | |
1074 by default on my system. I then added a few lines to the | |
1075 end of the config file, to configure these modules.</para> | |
1076 <programlisting>userdir.path = "public_html" | |
1077 cgi.assign = (".cgi" => "" )</programlisting> | |
1078 <para>With this done, <literal>lighttpd</literal> ran | |
1079 immediately for me. If I had configured | |
1080 <literal>lighttpd</literal> before Apache, I'd almost | |
1081 certainly have run into many of the same system-level | |
1082 configuration problems as I did with Apache. However, I | |
1083 found <literal>lighttpd</literal> to be noticeably easier to | |
1084 configure than Apache, even though I've used Apache for over | |
1085 a decade, and this was my first exposure to | |
1086 <literal>lighttpd</literal>.</para> | |
1087 | |
1088 </sect3> | |
1089 </sect2> | |
1090 <sect2> | |
1091 <title>Sharing multiple repositories with one CGI script</title> | |
1092 | |
1093 <para>The <filename role="special">hgweb.cgi</filename> script | |
1094 only lets you publish a single repository, which is an | |
1095 annoying restriction. If you want to publish more than one | |
1096 without wracking yourself with multiple copies of the same | |
1097 script, each with different names, a better choice is to use | |
1098 the <filename role="special">hgwebdir.cgi</filename> | |
1099 script.</para> | |
1100 | |
1101 <para>The procedure to configure <filename | |
1102 role="special">hgwebdir.cgi</filename> is only a little more | |
1103 involved than for <filename | |
1104 role="special">hgweb.cgi</filename>. First, you must obtain | |
1105 a copy of the script. If you don't have one handy, you can | |
1106 download a copy from the master Mercurial repository at <ulink | |
1107 url="http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi</ulink>.</para> | |
1108 | |
1109 <para>You'll need to copy this script into your <filename | |
1110 class="directory">public_html</filename> directory, and | |
1111 ensure that it's executable.</para> | |
1112 <programlisting>cp .../hgwebdir.cgi ~/public_html | |
1113 chmod 755 ~/public_html ~/public_html/hgwebdir.cgi</programlisting> | |
1114 <para>With basic configuration out of the way, try to visit | |
1115 <ulink url="http://myhostname/ | |
1116 myuser/hgwebdir.cgi">http://myhostname/ | |
1117 myuser/hgwebdir.cgi</ulink> in your browser. It should | |
1118 display an empty list of repositories. If you get a blank | |
1119 window or error message, try walking through the list of | |
1120 potential problems in section <xref | |
1121 linkend="sec:collab:wtf"/>.</para> | |
1122 | |
1123 <para>The <filename role="special">hgwebdir.cgi</filename> | |
1124 script relies on an external configuration file. By default, | |
1125 it searches for a file named <filename | |
1126 role="special">hgweb.config</filename> in the same directory | |
1127 as itself. You'll need to create this file, and make it | |
1128 world-readable. The format of the file is similar to a | |
1129 Windows <quote>ini</quote> file, as understood by Python's | |
1130 <literal>ConfigParser</literal> | |
1131 <citation>web:configparser</citation> module.</para> | |
1132 | |
1133 <para>The easiest way to configure <filename | |
1134 role="special">hgwebdir.cgi</filename> is with a section | |
1135 named <literal>collections</literal>. This will automatically | |
1136 publish <emphasis>every</emphasis> repository under the | |
1137 directories you name. The section should look like | |
1138 this:</para> | |
1139 <programlisting>[collections] | |
1140 /my/root = /my/root</programlisting> | |
1141 <para>Mercurial interprets this by looking at the directory name | |
1142 on the <emphasis>right</emphasis> hand side of the | |
1143 <quote><literal>=</literal></quote> sign; finding repositories | |
1144 in that directory hierarchy; and using the text on the | |
1145 <emphasis>left</emphasis> to strip off matching text from the | |
1146 names it will actually list in the web interface. The | |
1147 remaining component of a path after this stripping has | |
1148 occurred is called a <quote>virtual path</quote>.</para> | |
1149 | |
1150 <para>Given the example above, if we have a repository whose | |
1151 local path is <filename | |
1152 class="directory">/my/root/this/repo</filename>, the CGI | |
1153 script will strip the leading <filename | |
1154 class="directory">/my/root</filename> from the name, and | |
1155 publish the repository with a virtual path of <filename | |
1156 class="directory">this/repo</filename>. If the base URL for | |
1157 our CGI script is <ulink url="http://myhostname/ | |
1158 myuser/hgwebdir.cgi">http://myhostname/ | |
1159 myuser/hgwebdir.cgi</ulink>, the complete URL for that | |
1160 repository will be <ulink url="http://myhostname/ | |
1161 myuser/hgwebdir.cgi/this/repo">http://myhostname/ | |
1162 myuser/hgwebdir.cgi/this/repo</ulink>.</para> | |
1163 | |
1164 <para>If we replace <filename | |
1165 class="directory">/my/root</filename> on the left hand side | |
1166 of this example with <filename | |
1167 class="directory">/my</filename>, then <filename | |
1168 role="special">hgwebdir.cgi</filename> will only strip off | |
1169 <filename class="directory">/my</filename> from the repository | |
1170 name, and will give us a virtual path of <filename | |
1171 class="directory">root/this/repo</filename> instead of | |
1172 <filename class="directory">this/repo</filename>.</para> | |
1173 | |
1174 <para>The <filename role="special">hgwebdir.cgi</filename> | |
1175 script will recursively search each directory listed in the | |
1176 <literal>collections</literal> section of its configuration | |
1177 file, but it will <literal>not</literal> recurse into the | |
1178 repositories it finds.</para> | |
1179 | |
1180 <para>The <literal>collections</literal> mechanism makes it easy | |
1181 to publish many repositories in a <quote>fire and | |
1182 forget</quote> manner. You only need to set up the CGI | |
1183 script and configuration file one time. Afterwards, you can | |
1184 publish or unpublish a repository at any time by simply moving | |
1185 it into, or out of, the directory hierarchy in which you've | |
1186 configured <filename role="special">hgwebdir.cgi</filename> to | |
1187 look.</para> | |
1188 | |
1189 <sect3> | |
1190 <title>Explicitly specifying which repositories to | |
1191 publish</title> | |
1192 | |
1193 <para>In addition to the <literal>collections</literal> | |
1194 mechanism, the <filename | |
1195 role="special">hgwebdir.cgi</filename> script allows you | |
1196 to publish a specific list of repositories. To do so, | |
1197 create a <literal>paths</literal> section, with contents of | |
1198 the following form.</para> | |
1199 <programlisting>[paths] | |
1200 repo1 = /my/path/to/some/repo | |
1201 repo2 = /some/path/to/another</programlisting> | |
1202 <para>In this case, the virtual path (the component that will | |
1203 appear in a URL) is on the left hand side of each | |
1204 definition, while the path to the repository is on the | |
1205 right. Notice that there does not need to be any | |
1206 relationship between the virtual path you choose and the | |
1207 location of a repository in your filesystem.</para> | |
1208 | |
1209 <para>If you wish, you can use both the | |
1210 <literal>collections</literal> and <literal>paths</literal> | |
1211 mechanisms simultaneously in a single configuration | |
1212 file.</para> | |
1213 | |
1214 <note> | |
1215 <para> If multiple repositories have the same virtual path, | |
1216 <filename role="special">hgwebdir.cgi</filename> will not | |
1217 report an error. Instead, it will behave | |
1218 unpredictably.</para> | |
1219 </note> | |
1220 | |
1221 </sect3> | |
1222 </sect2> | |
1223 <sect2> | |
1224 <title>Downloading source archives</title> | |
1225 | |
1226 <para>Mercurial's web interface lets users download an archive | |
1227 of any revision. This archive will contain a snapshot of the | |
1228 working directory as of that revision, but it will not contain | |
1229 a copy of the repository data.</para> | |
1230 | |
1231 <para>By default, this feature is not enabled. To enable it, | |
1232 you'll need to add an <envar | |
1233 role="rc-item-web">allow_archive</envar> item to the | |
1234 <literal role="rc-web">web</literal> section of your <filename | |
1235 role="special">~/.hgrc</filename>.</para> | |
1236 | |
1237 </sect2> | |
1238 <sect2> | |
1239 <title>Web configuration options</title> | |
1240 | |
1241 <para>Mercurial's web interfaces (the <command role="hg-cmd">hg | |
1242 serve</command> command, and the <filename | |
1243 role="special">hgweb.cgi</filename> and <filename | |
1244 role="special">hgwebdir.cgi</filename> scripts) have a | |
1245 number of configuration options that you can set. These | |
1246 belong in a section named <literal | |
1247 role="rc-web">web</literal>.</para> | |
1248 <itemizedlist> | |
1249 <listitem><para><envar | |
1250 role="rc-item-web">allow_archive</envar>: Determines | |
1251 which (if any) archive download mechanisms Mercurial | |
1252 supports. If you enable this feature, users of the web | |
1253 interface will be able to download an archive of whatever | |
1254 revision of a repository they are viewing. To enable the | |
1255 archive feature, this item must take the form of a | |
1256 sequence of words drawn from the list below.</para> | |
1257 <itemizedlist> | |
1258 <listitem><para><literal>bz2</literal>: A | |
1259 <command>tar</command> archive, compressed using | |
1260 <literal>bzip2</literal> compression. This has the | |
1261 best compression ratio, but uses the most CPU time on | |
1262 the server.</para> | |
1263 </listitem> | |
1264 <listitem><para><literal>gz</literal>: A | |
1265 <command>tar</command> archive, compressed using | |
1266 <literal>gzip</literal> compression.</para> | |
1267 </listitem> | |
1268 <listitem><para><literal>zip</literal>: A | |
1269 <command>zip</command> archive, compressed using LZW | |
1270 compression. This format has the worst compression | |
1271 ratio, but is widely used in the Windows world.</para> | |
1272 </listitem> | |
1273 </itemizedlist> | |
1274 <para> If you provide an empty list, or don't have an | |
1275 <envar role="rc-item-web">allow_archive</envar> entry at | |
1276 all, this feature will be disabled. Here is an example of | |
1277 how to enable all three supported formats.</para> | |
1278 <programlisting>[web] | |
1279 allow_archive = bz2 gz zip</programlisting> | |
1280 </listitem> | |
1281 <listitem><para><envar role="rc-item-web">allowpull</envar>: | |
1282 Boolean. Determines whether the web interface allows | |
1283 remote users to <command role="hg-cmd">hg pull</command> | |
1284 and <command role="hg-cmd">hg clone</command> this | |
1285 repository over HTTP. If set to <literal>no</literal> or | |
1286 <literal>false</literal>, only the | |
1287 <quote>human-oriented</quote> portion of the web interface | |
1288 is available.</para> | |
1289 </listitem> | |
1290 <listitem><para><envar role="rc-item-web">contact</envar>: | |
1291 String. A free-form (but preferably brief) string | |
1292 identifying the person or group in charge of the | |
1293 repository. This often contains the name and email | |
1294 address of a person or mailing list. It often makes sense | |
1295 to place this entry in a repository's own <filename | |
1296 role="special">.hg/hgrc</filename> file, but it can make | |
1297 sense to use in a global <filename | |
1298 role="special">~/.hgrc</filename> if every repository | |
1299 has a single maintainer.</para> | |
1300 </listitem> | |
1301 <listitem><para><envar role="rc-item-web">maxchanges</envar>: | |
1302 Integer. The default maximum number of changesets to | |
1303 display in a single page of output.</para> | |
1304 </listitem> | |
1305 <listitem><para><envar role="rc-item-web">maxfiles</envar>: | |
1306 Integer. The default maximum number of modified files to | |
1307 display in a single page of output.</para> | |
1308 </listitem> | |
1309 <listitem><para><envar role="rc-item-web">stripes</envar>: | |
1310 Integer. If the web interface displays alternating | |
1311 <quote>stripes</quote> to make it easier to visually align | |
1312 rows when you are looking at a table, this number controls | |
1313 the number of rows in each stripe.</para> | |
1314 </listitem> | |
1315 <listitem><para><envar role="rc-item-web">style</envar>: | |
1316 Controls the template Mercurial uses to display the web | |
1317 interface. Mercurial ships with two web templates, named | |
1318 <literal>default</literal> and <literal>gitweb</literal> | |
1319 (the latter is much more visually attractive). You can | |
1320 also specify a custom template of your own; see chapter | |
1321 <xref linkend="chap:template"/> for details. | |
1322 Here, you can see how to enable the | |
1323 <literal>gitweb</literal> style.</para> | |
1324 <programlisting>[web] | |
1325 style = gitweb</programlisting> | |
1326 </listitem> | |
1327 <listitem><para><envar role="rc-item-web">templates</envar>: | |
1328 Path. The directory in which to search for template | |
1329 files. By default, Mercurial searches in the directory in | |
1330 which it was installed.</para> | |
1331 </listitem></itemizedlist> | |
1332 <para>If you are using <filename | |
1333 role="special">hgwebdir.cgi</filename>, you can place a few | |
1334 configuration items in a <literal role="rc-web">web</literal> | |
1335 section of the <filename | |
1336 role="special">hgweb.config</filename> file instead of a | |
1337 <filename role="special">~/.hgrc</filename> file, for | |
1338 convenience. These items are <envar | |
1339 role="rc-item-web">motd</envar> and <envar | |
1340 role="rc-item-web">style</envar>.</para> | |
1341 | |
1342 <sect3> | |
1343 <title>Options specific to an individual repository</title> | |
1344 | |
1345 <para>A few <literal role="rc-web">web</literal> configuration | |
1346 items ought to be placed in a repository's local <filename | |
1347 role="special">.hg/hgrc</filename>, rather than a user's | |
1348 or global <filename role="special">~/.hgrc</filename>.</para> | |
1349 <itemizedlist> | |
1350 <listitem><para><envar | |
1351 role="rc-item-web">description</envar>: String. A | |
1352 free-form (but preferably brief) string that describes | |
1353 the contents or purpose of the repository.</para> | |
1354 </listitem> | |
1355 <listitem><para><envar role="rc-item-web">name</envar>: | |
1356 String. The name to use for the repository in the web | |
1357 interface. This overrides the default name, which is | |
1358 the last component of the repository's path.</para> | |
1359 </listitem></itemizedlist> | |
1360 | |
1361 </sect3> | |
1362 <sect3> | |
1363 <title>Options specific to the <command role="hg-cmd">hg | |
1364 serve</command> command</title> | |
1365 | |
1366 <para>Some of the items in the <literal | |
1367 role="rc-web">web</literal> section of a <filename | |
1368 role="special">~/.hgrc</filename> file are only for use | |
1369 with the <command role="hg-cmd">hg serve</command> | |
1370 command.</para> | |
1371 <itemizedlist> | |
1372 <listitem><para><envar role="rc-item-web">accesslog</envar>: | |
1373 Path. The name of a file into which to write an access | |
1374 log. By default, the <command role="hg-cmd">hg | |
1375 serve</command> command writes this information to | |
1376 standard output, not to a file. Log entries are written | |
1377 in the standard <quote>combined</quote> file format used | |
1378 by almost all web servers.</para> | |
1379 </listitem> | |
1380 <listitem><para><envar role="rc-item-web">address</envar>: | |
1381 String. The local address on which the server should | |
1382 listen for incoming connections. By default, the server | |
1383 listens on all addresses.</para> | |
1384 </listitem> | |
1385 <listitem><para><envar role="rc-item-web">errorlog</envar>: | |
1386 Path. The name of a file into which to write an error | |
1387 log. By default, the <command role="hg-cmd">hg | |
1388 serve</command> command writes this information to | |
1389 standard error, not to a file.</para> | |
1390 </listitem> | |
1391 <listitem><para><envar role="rc-item-web">ipv6</envar>: | |
1392 Boolean. Whether to use the IPv6 protocol. By default, | |
1393 IPv6 is not used.</para> | |
1394 </listitem> | |
1395 <listitem><para><envar role="rc-item-web">port</envar>: | |
1396 Integer. The TCP port number on which the server should | |
1397 listen. The default port number used is 8000.</para> | |
1398 </listitem></itemizedlist> | |
1399 | |
1400 </sect3> | |
1401 <sect3> | |
1402 <title>Choosing the right <filename | |
1403 role="special">~/.hgrc</filename> file to add <literal | |
1404 role="rc-web">web</literal> items to</title> | |
1405 | |
1406 <para>It is important to remember that a web server like | |
1407 Apache or <literal>lighttpd</literal> will run under a user | |
1408 ID that is different to yours. CGI scripts run by your | |
1409 server, such as <filename | |
1410 role="special">hgweb.cgi</filename>, will usually also run | |
1411 under that user ID.</para> | |
1412 | |
1413 <para>If you add <literal role="rc-web">web</literal> items to | |
1414 your own personal <filename role="special">~/.hgrc</filename> file, CGI scripts won't read that | |
1415 <filename role="special">~/.hgrc</filename> file. Those | |
1416 settings will thus only affect the behaviour of the <command | |
1417 role="hg-cmd">hg serve</command> command when you run it. | |
1418 To cause CGI scripts to see your settings, either create a | |
1419 <filename role="special">~/.hgrc</filename> file in the | |
1420 home directory of the user ID that runs your web server, or | |
1421 add those settings to a system-wide <filename | |
1422 role="special">~/.hgrc</filename> file.</para> | |
1423 | |
1424 | |
1425 </sect3> | |
1426 </sect2> | |
1427 </sect1> | |
1428 </chapter> | |
1429 | |
1430 <!-- | |
1431 local variables: | |
1432 sgml-parent-document: ("00book.xml" "book" "chapter") | |
1433 end: | |
1434 --> |