Proxy server
A proxy server is a computer that offers a computer network service to allow clients to make indirect network connections to other network services. A client connects to the proxy server, then requests a connection, file, or other resource available on a different server. The proxy provides the resource either by connecting to the specified server or by serving it from a cache. In some cases, the proxy may alter the client's request or the server's response for various purposes.
A proxy server can also serve as a firewall.
Web proxies
A common proxy application is a caching Web proxy. This provides a nearby cache of Web pages and files available on remote Web servers, allowing local network clients to access them more quickly or reliably.
When it receives a request for a Web resource (specified by a URL), a caching proxy looks for the resulting URL in its local cache. If found, it returns the document immediately. Otherwise it fetches it from the remote server, returns it to the requester and saves a copy in the cache. The cache usually uses an expiry algorithm to remove documents from the cache, according to their age, size, and access history. Two simple cache algorithms are Least Recently Used (LRU) and Least Frequently Used (LFU). LRU removes the documents that have been left the longest, while LFU removes the least popular documents.
Web proxies can also filter the content of Web pages served. Some censorware applications — which attempt to block offensive Web content — are implemented as Web proxies. Other web proxies reformat web pages for a specific purpose or audience; for example, Skweezer reformats web pages for cell phones and PDAs. Network operators can also deploy proxies to intercept computer viruses and other hostile content served from remote Web pages.
A special case of web proxies are "CGI proxies." These are web sites which allow a user to access a site through them. They generally use PHP or CGI to implement the proxying functionality. CGI proxies are frequently used to gain access to web sites blocked by corporate or school proxies. Since they also hide the user's own IP address from the web sites they access through the proxy, they are sometimes also used to gain a degree of anonymity.
Proxy transparency
Many organizations — including corporations, schools, and families — use proxy servers to enforce network use policies (see censorware) or provide security and caching services. Usually, the web proxy is not transparent to the client application: it must be configured to use the proxy, manually or with a configuration script. Thus, the user can evade the proxy by simply resetting the client configuration, except in the case where the proxy is used instead of a NAT router to share an internet connection or a LAN. Such proxies may be difficult to configure for applications requiring a large port range out going and may only be able to route inward to a single server for a given UDP or TCP port (see Wingate 2.x versions). However such proxies may have more extensive logging or more customizable security than a simple NAT router box.
A transparent proxy or transproxy (also known as a forced proxy) combines a proxy server with NAT so that connections are routed into the proxy without client-side configuration. However, RFC 3040 defines this type as intercepting proxy.
Both NAT and transproxies are somewhat controversial in the Internet technical community, since both violate the end-to-end principle upon which TCP/IP was designed.
The term proxy is also used in a different sense in the Session Initiation Protocol (SIP) used in many modern voice over IP systems. A SIP Proxy, unlike a Web proxy, does not handle the content of client data.
Open proxies, abuse, and detection
An open proxy is a proxy server which will accept client connections from any IP address and make connections to any Internet resource. Abuse of open proxies is currently implicated in a significant portion of e-mail spam delivery. Spammers frequently install open proxies on unwitting end users' Microsoft Windows computers by means of computer viruses designed for this purpose. Internet Relay Chat (IRC) abusers also frequently use open proxies to cloak their identities.
Because proxies could be implicated in abuse, system administrators have developed a number of ways to refuse service to open proxies. IRC networks such as the Blitzed network automatically test client systems for known types of open proxy [1]. Likewise, an email server may be configured to automatically test e-mail senders for open proxies, using software such as Michael Tokarev's proxycheck
[2].
Groups of IRC and electronic mail operators run DNSBLs publishing lists of the IP addresses of known open proxies, such as AHBL, CBL [3], NJABL [4], and SORBS.
The ethics of automatically testing clients for open proxies are controversial. Some experts, such as Vernon Schryver, consider such testing to be equivalent to an attacker portscanning the client host. [5] Others consider the client to have solicited the scan by connecting to a server whose terms of service include testing.
Reverse proxies
A reverse proxy is a proxy server that is installed in the neighborhood of one or more webservers. All traffic coming from the Internet and with a destination of one of the webservers is going through the proxy server. There are several reasons for installing reverse proxy servers:
- Security: the proxy server is an additional layer of defense and therefore protects the webservers further up the chain
- Encryption / SSL acceleration: when secure websites are created, the SSL encryption is often not done by the webserver itself, but by a reverse proxy that is equipped with SSL acceleration hardware. See Secure Sockets Layer.
- Load distribution: the reverse proxy can distribute the load to several webservers, each webserver serving its own application area. In such a case, the reverse proxy may need to rewrite the URLs in each webpage (translation from externally known URLs to the internal locations)
- Serve/cache static content: A reverse proxy can offload the webservers by caching static content like pictures and other static graphical content (See Squid cache)
- Compression: the proxy server can optimize and compress the content to speed up the load time.
Split proxies
A split proxy is essentially a pair of proxies installed across two computers. Since they are effectively two parts of the same program, they can communicate with each other in a more efficient way than they can communicate with a more standard resource or tool such as a website or browser. This is ideal for compressing data over a slow link, such as a wireless or mobile data service and also for reducing the issues regarding high latency links (such as satellite internet) where estabilishing a TCP connection is time consuming. Taking the example of web browsing, the user's browser is pointed to a local proxy which then communicates with its other half at some remote location. This remote server fetches the requisite data, repackages it and sends it back to the user's local proxy, which unpacks the data and presents it to the browser in the standard fashion .
Anonymous proxy risks
In using a proxy server (for example, anonymizing HTTP proxy), all data sent to the service being used (for example, HTTP server in a website) must pass through the proxy server before being sent to the service, mostly in unencrypted form. It is therefore possible, and has been demonstrated (see, for example, Sugarcane) for a malicious proxy server to record everything sent to the proxy: including unencrypted logins and passwords.
By chaining proxies which do not reveal data about the original requestor, it is possible to obfuscate activities from the eyes of the user's destination. However, more traces will be left on the intermediate hops, which could be used or offered up to trace the user's activities. If the policies and administrators of these other proxies are unknown, the user may fall victim to a false sense of security just because those details are out of sight and mind.
The bottom line of this is to be wary when using proxy servers, and only use proxy servers of known integrity (e.g., the owner is known and trusted, has a clear privacy policy, etc.), and never use proxy servers of unknown integrity. If there is no choice but to use unknown proxy servers, do not pass any private information (unless it is properly encrypted) through the proxy.
See also
- Anonymity and Internet privacy
- SOCKS
- Proxy auto-config (PAC)
- Web Proxy Autodiscovery Protocol (WPAD)
- Skweezer
- Firewall (networking)
- Circumventor
Popular proxy server software
- The Squid cache is a popular HTTP proxy server in the UNIX/Linux world.
- The Apache HTTP Server can be configured to act as a proxy server.
- Wingate is a multi protocol proxy server and NAT solution that can be used to redirect any kind of traffic on a Microsoft Windows host. It also provides firewall and mail server functionality.
- Privoxy is a free, open source web proxy with privacy features
- Microsoft Internet Security and Acceleration Server is a product that runs on Windows 2000/2003 servers and combines the functions of both a proxy server and a firewall.
- Tor - A proxy-based anonymyzing Internet communication system
- Proxomitron - User-configurable web proxy used to re-write webpages on the fly. Most noted for blocking ads, but has many other useful features.
- PHProxy is a Web HTTP proxy programmed in PHP to bypass firewalls and other proxy restrictions through a Web interface very similar to the popular CGIProxy.
External links
- Proxy Servers Tutorial - an introduction
- Template:Dmoz
- Template:Dmoz
This article is based on material taken from the Free On-line Dictionary of Computing prior to 1 November 2008 and incorporated under the "relicensing" terms of the GFDL, version 1.3 or later.