cpython

Commit Graph

Author	SHA1	Message	Date
Marc-André Lemburg	e5034378cc	Removing UTF-16 aware Unicode comparison code. This kind of compare function (together with other locale aware ones) should into a new collation support module. See python-dev for a discussion of this removal. Note: This patch should also be applied to the 1.6 branch.	2000-08-08 08:04:29 +00:00
Moshe Zadka	cf703f04ad	Removing warnings found by gcc -Wall	2000-08-04 15:36:13 +00:00
Tim Peters	72d421b75c	Boost buffer sizes in the absence of snprintf on Windows. Ensure that # of args to sprintf always matches # of format specifiers.	2000-08-04 03:05:40 +00:00
Fred Drake	c76e0e5679	snprintf() is not portable, so continue to use sprintf() until a portable snprintf() is available.	2000-08-04 02:34:41 +00:00
Marc-André Lemburg	bff879cabb	This patch finalizes the move from UTF-8 to a default encoding in the Python Unicode implementation. The internal buffer used for implementing the buffer protocol is renamed to defenc to make this change visible. It now holds the default encoded version of the Unicode object and is calculated on demand (NULL otherwise). Since the default encoding defaults to ASCII, this will mean that Unicode objects which hold non-ASCII characters will no longer work on C APIs using the "s" or "t" parser markers. C APIs must now explicitly provide Unicode support via the "u", "U" or "es"/"es#" parser markers in order to work with non-ASCII Unicode strings. (Note: this patch will also have to be applied to the 1.6 branch of the CVS tree.)	2000-08-03 18:46:08 +00:00
Fred Drake	2b83b4601f	Remove the tp_print handler. Revise the tp_repr handler to produce a more "minimal" presentation. Make the tolist() method use PyArg_ParseTuple() and provide a docstring.	2000-08-03 17:43:02 +00:00
Guido van Rossum	c4a19e7fe9	Remobe beopen/cnri/cwi copyrights, according to CNRI instructions. This doesn't change the copyright status for these files -- just the markings! Doing it on the main branch for these three files for which the HEAD revision was pushed back into 1.6.	2000-08-03 16:42:14 +00:00
Guido van Rossum	16b1ad9c7d	Changing the CNRI copyright notice according to CNRI's instructions. This is a notice without a date, which apparently is not a claim to copyright but only advice to the reader. IANAL. :-)	2000-08-03 16:24:25 +00:00
Peter Schneider-Kamp	7e01890986	merge Include/my.h into Include/pyport.h marked my.h as obsolete	2000-07-31 15:28:04 +00:00
Thomas Wouters	334fb8985b	Use 'void' directly instead of the ANY #define, now that all code is ANSI C. Leave the actual #define in for API compatibility.	2000-07-25 12:56:38 +00:00
Thomas Wouters	c307352027	ANSIfy functions that were hiding inside a macro.	2000-07-23 22:09:59 +00:00
Thomas Wouters	a534594fc7	ANSIfication: remove very-old-varargs code, fix function declarations so they include prototypes.	2000-07-22 23:59:33 +00:00
Thomas Wouters	7889010731	Miscelaneous ANSIfications. I'm assuming here 'main' should take (int, char**) and return an int even on PC platforms. If not, please fix PC/utils/makesrc.c ;-P	2000-07-22 19:25:51 +00:00
Marc-André Lemburg	9542f48fd5	Fixed problems with UTF error reporting macros and some formatting bugs.	2000-07-17 18:23:13 +00:00
Marc-André Lemburg	cf5f358784	Restore PyXXX_Length() APIs for binary compatibility. New code will see the macros and therefore use the PyXXX_Size() APIs instead. By Thomas Wouters.	2000-07-17 09:22:55 +00:00
Greg Stein	af36a3aa20	gcc is being stupid with if/else constructs clean out some other warnings	2000-07-17 09:04:43 +00:00
Greg Stein	ff975003cf	stop messing around with goto and just write the macro correctly.	2000-07-16 21:39:49 +00:00
Fredrik Lundh	0e19e76aba	- change \x to mean "byte" also in unicode literals (patch #100912)	2000-07-16 18:47:43 +00:00
Tim Peters	855ffac224	Fix fatal compiler (MSVC6) error: unicodeobject.c(735) : error C2143: syntax error : missing ';' before '}'	2000-07-16 17:10:50 +00:00
Marc-André Lemburg	fb625847bf	Fix to a bug found by Florian Weimer: The UTF-8 decoder is still buggy (i.e. it doesn't pass Markus Kuhn's stress test), mainly due to the following construct: #define UTF8_ERROR(details) do { \ if (utf8_decoding_error(&s, &p, errors, details)) \ goto onError; \ continue; \ } while (0) (The "continue" statement is supposed to exit from the outer loop, but of course, it doesn't. Indeed, this is a marvelous example of the dangers of the C programming language and especially of the C preprocessor.)	2000-07-16 13:29:13 +00:00
Thomas Wouters	7e47402264	Spelling fixes supplied by Rob W. W. Hooft. All these are fixes in either comments, docstrings or error messages. I fixed two minor things in test_winreg.py ("didn't" -> "Didn't" and "Didnt" -> "Didn't"). There is a minor style issue involved: Guido seems to have preferred English grammar (behaviour, honour) in a couple places. This patch changes that to American, which is the more prominent style in the source. I prefer English myself, so if English is preferred, I'd be happy to supply a patch myself ;)	2000-07-16 12:04:32 +00:00
Vladimir Marangozov	467a67e74d	Fix in PyList_New(). With GC enabled and when out of memory, free() the GC pointer, not the object pointer.	2000-07-15 03:31:31 +00:00
Andrew M. Kuchling	06051edc0d	Added PyObject_AsFileDescriptor, which checks for integer, long integer, or .fileno() method	2000-07-13 23:56:54 +00:00
Vladimir Marangozov	8dc19f672b	Propagate the current exception in get_inprogress_dict() -- it doesn't need to be cleared.	2000-07-12 23:39:38 +00:00
Jeremy Hylton	03657cfdb0	replace PyXXX_Length calls with PyXXX_Size calls	2000-07-12 13:05:33 +00:00
Jeremy Hylton	6253f83b0a	change abstract size functions PySequence_Size &c. add macros for backwards compatibility with C source	2000-07-12 12:56:19 +00:00
Andrew M. Kuchling	bd9848d02f	Fix typo in error message	2000-07-12 02:58:28 +00:00
Jack Jansen	28fc880e9a	Include macglue.h on the macintosh, so function prototypes are in scope.	2000-07-11 21:47:20 +00:00
Jeremy Hylton	88887aa38e	small updates to string_join: use PyString_AS_STRING macro on local string object when resizing string, make sure resized string will always be big enough split string containing error message across two lines add test to string_tests that causes resizing	2000-07-11 20:55:38 +00:00
Marc-André Lemburg	566d8a64eb	Jeremy Hylton: better error message for unicode coercion failure	2000-07-11 09:47:04 +00:00
Barry Warsaw	771d0675b6	string_join(): Some cleaning up of reference counting. In the seqlen==1 clause, before returning item, we need to DECREF seq. In the res=PyString... failure clause, we need to goto finally to also decref seq (and the DECREF of res in finally is changed to a XDECREF). Also, we need to DECREF seq just before the PyUnicode_Join() return.	2000-07-11 04:58:12 +00:00
Jeremy Hylton	4904829dbf	fix two refcount bugs in new string_join implementation: 1. PySequence_Fast_GET_ITEM is a macro and borrows a reference 2. The seq returned from PySequence_Fast must be decref'd	2000-07-11 03:28:17 +00:00
Jeremy Hylton	194e43e953	two changes to string_join: implementation -- use PySequence_Fast interface to iterate over elements interface -- if instance object reports wrong length, ignore it; previous version raised an IndexError if reported length was too high	2000-07-10 21:30:28 +00:00
Fredrik Lundh	dde6164402	- changed hash calculation for unicode strings. the new value is calculated from the character values, in a way that makes sure an 8-bit ASCII string and a unicode string with the same contents get the same hash value. (as a side effect, this also works for ISO Latin 1 strings). for more details, see the python-dev discussion.	2000-07-10 18:27:47 +00:00
Fred Drake	100814dc44	ANSI-fication of the sources.	2000-07-09 15:48:49 +00:00
Fred Drake	a2f5511941	ANSI-fication of the sources.	2000-07-09 15:16:51 +00:00
Tim Peters	c2e7da9859	Somebody started playing with const, so of course the outcome was cascades of warnings about mismatching const decls. Overall, I think const creates lots of headaches and solves almost nothing. Added enough consts to shut up the warnings, but this did require casting away const in one spot too (another usual outcome of starting down this path): the function mymemreplace can't return const char, but sometimes wants to return its first argument as-is, which latter must be declared const char in order to avoid const warnings at mymemreplace's call sites. So, in the case the function wants to return the first arg, that arg's declared constness must be subverted.	2000-07-09 08:02:21 +00:00
Fred Drake	ba09633e1e	ANSI-fication of the sources.	2000-07-09 07:04:36 +00:00
Fred Drake	45cfbcccc2	ANSI-fication of the sources.	2000-07-09 06:21:27 +00:00
Fred Drake	ee238b977f	ANSI-fication of the sources.	2000-07-09 06:03:25 +00:00
Fred Drake	1b190b4636	ANSI-fication of the sources.	2000-07-09 05:40:56 +00:00
Fred Drake	1f0968c5f8	Remove legacy use of __SC__; no longer needed now that ANSI source is the standard for Python implementation.	2000-07-09 05:31:24 +00:00
Fred Drake	fd99de6470	ANSI-fication of the sources.	2000-07-09 05:02:18 +00:00
Fred Drake	4288c80599	ANSI-fication of the sources.	2000-07-09 04:36:04 +00:00
Fred Drake	4201b9e420	type_error(): Added "const" to signature to eliminate warning with -Wall.	2000-07-09 04:34:13 +00:00
Fred Drake	3be9a8a5ed	ANSI-fication of the source. Make the indentation and brace placement internally consistent.	2000-07-09 04:14:42 +00:00
Fred Drake	799124718d	ANSI-fication of the sources.	2000-07-09 04:06:11 +00:00
Tim Peters	dbd9ba6a6c	Nuke all remaining occurrences of Py_PROTO and Py_FPROTO.	2000-07-09 03:09:57 +00:00
Fredrik Lundh	2a1e060619	- changed __repr__ to use "unicode escape" encoding for unicode strings, instead of the default encoding. (see "minidom" thread for discussion, and also patch #100706)	2000-07-08 17:43:32 +00:00
Skip Montanaro	4cbc9f7650	delete unused local variable from _PyTrash_deposit_object	2000-07-08 12:06:36 +00:00
Skip Montanaro	4ca150bdb2	_Py_RefTotal should only be declared here when Py_TRACE_REFS are #define'd	2000-07-08 12:04:57 +00:00
Tim Peters	7d3a511a40	Cray J90 fixes for long ints. This was a convenient excuse to create the pyport.h file recently discussed! Please use new Py_ARITHMETIC_RIGHT_SHIFT when right-shifting a signed int and you need sign-extension. This is #define'd in pyport.h, keying off new config symbol SIGNED_RIGHT_SHIFT_ZERO_FILLS. If you're running on a platform that needs that symbol #define'd, the std tests never would have worked for you (in particular, at least test_long would have failed). The autoconfig stuff got added to Python after my Unix days, so I don't know how that works. Would someone please look into doing & testing an auto-config of the SIGNED_RIGHT_SHIFT_ZERO_FILLS symbol? It needs to be defined if & only if, e.g., (-1) >> 3 is not -1.	2000-07-08 04:17:21 +00:00
Tim Peters	43f04a36cf	The tail end of x_sub implicitly assumed that an unsigned short contains 16 bits. Not true on Cray J90.	2000-07-08 02:26:47 +00:00
Tim Peters	9ace6bc7ef	Got RID of redundant coercions in longobject.c (as spotted by Greg Stein -- thanks!). Incidentally removed all the Py_PROTO macros from object.h, as they prevented my editor from magically finding the definitions of the "coercion", "cmpfunc" and "reprfunc" typedefs that were being redundantly applied in longobject.c.	2000-07-08 00:32:04 +00:00
Marc-André Lemburg	e12896ec98	New surrogate support in the UTF-8 codec. By Bill Tutt.	2000-07-07 17:51:08 +00:00
Tim Peters	9f688bf9d2	Some cleanup of longs in prepartion for Cray J90 fixes: got rid of Py_PROTO, switched to ANSI function decls, and did some minor fiddling.	2000-07-07 15:53:28 +00:00
Marc-André Lemburg	5a5c81a0e9	Added new API PyUnicode_FromEncodedObject() which supports decoding objects including instance objects. The old API PyUnicode_FromObject() is still available as shortcut.	2000-07-07 13:46:42 +00:00
Marc-André Lemburg	063e0cb4c6	Fix to bug #393 (UTF16 codec didn't like empty strings) and corrected some usage of 'unsigned long' where Py_UNICODE should have been used.	2000-07-07 11:27:45 +00:00
Sjoerd Mullender	2629bd5a33	Two more places where long should be used instead of int. Especially true after revision 2.36 was checked in...	2000-07-07 09:47:24 +00:00
Marc-André Lemburg	449c325303	Fixed some code that used 'short' to use 'long' instead.	2000-07-06 20:13:23 +00:00
Marc-André Lemburg	85cc4d8940	Fixed a couple of places where 'int' was used where 'long' should have been used.	2000-07-06 19:43:31 +00:00
Jack Jansen	56cdce3070	Conditionally (currently on ifdef macintosh) break the large switch up into 1000-case smaller ones.	2000-07-06 13:57:38 +00:00
Marc-André Lemburg	63f3d17418	Added new codec APIs and a new interface method .encode() which works just like the Unicode one. The C APIs match the ones in the Unicode implementation, but were extended to be able to reuse the existing Unicode codecs for string purposes too. Conversions from string to Unicode and back are done using the default encoding.	2000-07-06 11:29:01 +00:00
Marc-André Lemburg	1f46860a29	Fix to bug #389 : Full_Name: Bastian Kleineidam Version: 2.0b1 CVS 5.7.2000 OS: Debian Linux 2.2 Submission from: earth.cs.uni-sb.de (134.96.252.92)	2000-07-05 15:32:40 +00:00
Marc-André Lemburg	a7acf425f6	Added new .isalpha() and .isalnum() methods which provide interfaces to the new alphabetic lookup APIs in unicodectype.c.	2000-07-05 09:49:44 +00:00
Marc-André Lemburg	f3938f55c7	Added new lookup API which matches all alphabetic Unicode characters, i.e the ones with category 'Ll','Lu','Lt','Lo','Lm'.	2000-07-05 09:48:59 +00:00
Marc-André Lemburg	4027f8f4b3	Added new .isalpha() and .isalnum() methods to match the same ones on the Unicode objects. Note that the string versions use the (locale aware) C lib APIs isalpha() and isalnum().	2000-07-05 09:47:46 +00:00
Tim Peters	1f5871e834	Removed Py_PROTO and switched to ANSI C declarations in the dict implementation. This was really to test whether my new CVS+SSH setup is more usable than the old one -- and turns out it is (for whatever reason, it was impossible to do a commit before that involved more than one directory).	2000-07-04 17:44:48 +00:00
Marc-André Lemburg	1e7205a62a	Bill Tutt: Make unicode_compare a true UTF-16 compare function (includes support for surrogates).	2000-07-04 09:51:07 +00:00
Marc-André Lemburg	891bc65486	If auto-conversion fails, the Unicode codecs will return NULL. This is now checked and the error passed on to the caller.	2000-07-03 09:57:53 +00:00
Fredrik Lundh	efecc7d05b	changed repr and str to always convert unicode strings to 8-bit strings, using the default encoding.	2000-07-01 14:31:09 +00:00
Guido van Rossum	4cc6ac7b87	Neil Schemenauer: small fixes for GC	2000-07-01 01:00:38 +00:00
Guido van Rossum	ffcc3813d8	Change copyright notice - 2nd try.	2000-06-30 23:58:06 +00:00
Guido van Rossum	fd71b9e9d4	Change copyright notice.	2000-06-30 23:50:40 +00:00
Guido van Rossum	9a15c211cf	Fix an error on AIX by using a proper cast.	2000-06-30 22:46:04 +00:00
Fred Drake	a44d353e2b	Trent Mick <trentm@activestate.com>: The common technique for printing out a pointer has been to cast to a long and use the "%lx" printf modifier. This is incorrect on Win64 where casting to a long truncates the pointer. The "%p" formatter should be used instead. The problem as stated by Tim: > Unfortunately, the C committee refused to define what %p conversion "looks > like" -- they explicitly allowed it to be implementation-defined. Older > versions of Microsoft C even stuck a colon in the middle of the address (in > the days of segment+offset addressing)! The result is that the hex value of a pointer will maybe/maybe not have a 0x prepended to it. Notes on the patch: There are two main classes of changes: - in the various repr() functions that print out pointers - debugging printf's in the various thread_*.h files (these are why the patch is large) Closes SourceForge patch #100505.	2000-06-30 15:01:00 +00:00
Marc-André Lemburg	d49e5b4667	Marc-Andre Lemburg <mal@lemburg.com>: A previous patch by Jack Jansen was accidently reverted.	2000-06-30 14:58:20 +00:00
Marc-André Lemburg	f28dd83b86	Marc-Andre Lemburg <mal@lemburg.com>: New buffer overflow checks for formatting strings. By Trent Mick.	2000-06-30 10:29:57 +00:00
Jeremy Hylton	c5007aa5c3	final patches from Neil Schemenauer for garbage collection	2000-06-30 05:02:53 +00:00
Fred Drake	13634cf7a4	This patch addresses two main issues: (1) There exist some non-fatal errors in some of the hash algorithms. For exmaple, in float_hash and complex_hash a certain part of the value is not included in the hash calculation. See Tim's, Guido's, and my discussion of this on python-dev in May under the title "fix float_hash and complex_hash for 64-bit nix" (2) The hash algorithms that use pointers (e.g. func_hash, code_hash) are universally not correct on Win64 (they assume that sizeof(long) == sizeof(void)) As well, this patch significantly cleans up the hash code. It adds the two function _Py_HashDouble and _PyHash_VoidPtr that the various hashing routine are changed to use. These help maintain the hash function invariant: (a==b) => (hash(a)==hash(b))) I have added Lib/test/test_hash.py and Lib/test/output/test_hash to test this for some cases.	2000-06-29 19:17:04 +00:00
Guido van Rossum	4f4b799b33	Jack Jansen: Use include "" instead of <>; and staticforward declarations	2000-06-29 00:06:39 +00:00
Guido van Rossum	d7823f2645	Vladimir Marangozov: Avoid calling the dealloc function, previously triggered with DECREF(inst). This caused a segfault in PyDict_GetItem, called with a NULL dict, whenever inst->in_dict fails under low-memory conditions.	2000-06-28 23:46:07 +00:00
Guido van Rossum	ad89bbcd88	Trent Mick: change a few casts for Win64 compatibility.	2000-06-28 21:57:18 +00:00
Guido van Rossum	eceebb87d9	Jack Jansen: Moved includes to the top, removed think C support	2000-06-28 20:57:07 +00:00
Marc-André Lemburg	0f774e3987	Marc-Andre Lemburg <mal@lemburg.com>: Patch to the standard unicode-escape codec which dynamically loads the Unicode name to ordinal mapping from the module ucnhash. By Bill Tutt.	2000-06-28 16:43:35 +00:00
Marc-André Lemburg	7c014684c2	Marc-Andre Lemburg <mal@lemburg.com>: Better error message for "1 in unicodestring". Submitted by Andrew Kuchling.	2000-06-28 08:11:47 +00:00
Jeremy Hylton	d08b4c4524	part 2 of Neil Schemenauer's GC patches: This patch modifies the type structures of objects that participate in GC. The object's tp_basicsize is increased when GC is enabled. GC information is prefixed to the object to maintain binary compatibility. GC objects also define the tp_flag Py_TPFLAGS_GC.	2000-06-23 19:37:02 +00:00
Jeremy Hylton	d22162bac7	traverse functions should return 0 on success	2000-06-23 17:14:56 +00:00
Jeremy Hylton	99a8f90874	raise TypeError when PyObject_Get/SetAttr called with non-string name	2000-06-23 14:36:32 +00:00
Jeremy Hylton	8caad49c30	Round 1 of Neil Schemenauer's GC patches: This patch adds the type methods traverse and clear necessary for GC implementation.	2000-06-23 14:18:11 +00:00
Fred Drake	396f6e0d6a	Fredrik Lundh <effbot@telia.com>: Simplify find code; this is a performance improvement on at least some platforms.	2000-06-20 15:47:54 +00:00
Marc-André Lemburg	49ef6dc1f4	Marc-Andre Lemburg <mal@lemburg.com>: Fixed a bug in PyUnicode_Count() which would have caused a core dump in case of substring coercion failure. Synchronized .count() with the string method of the same name to return len(s)+1 for s.count('').	2000-06-18 22:25:22 +00:00
Andrew M. Kuchling	74042d6e5d	Patch from /F: this patch introduces PySequence_Fast and PySequence_Fast_GET_ITEM, and modifies the list.extend method to accept any kind of sequence.	2000-06-18 18:43:14 +00:00
Marc-André Lemburg	bea47e768d	Vladimir MARANGOZOV <Vladimir.Marangozov@inrialpes.fr>: This patch fixes an optimisation mystery in _PyUnicodeNew causing segfaults on AIX when the interpreter is compiled with -O.	2000-06-17 20:31:17 +00:00
Marc-André Lemburg	29dc381ce0	Michael Hudson <mwh21@cam.ac.uk>: The error message refers to "append", yet the operation in question is "concat".	2000-06-16 17:05:57 +00:00
Fred Drake	56780257c6	Thomas Wouters <thomas@xs4all.net>: The following patch adds "sq_contains" support to rangeobject, and enables the already-written support for sq_contains in listobject and tupleobject. The rangeobject "contains" code should be a bit more efficient than the current default "in" implementation ;-) It might not get used much, but it's not that much to add. listobject.c and tupleobject.c already had code for sq_contains, and the proper struct member was set, but the PyType structure was not extended to include tp_flags, so the object-specific code was not getting called (Go ahead, test it ;-). I also did this for the immutable_list_type in listobject.c, eventhough it is probably never used. Symmetry and all that.	2000-06-15 14:50:20 +00:00
Marc-André Lemburg	60bc809d9a	Marc-Andre Lemburg <mal@lemburg.com>: Added code so that .isXXX() testing returns 0 for emtpy strings.	2000-06-14 09:18:32 +00:00
Marc-André Lemburg	07ceb67d9c	Marc-Andre Lemburg <mal@lemburg.com>: Fixed a typo and removed a debug printf(). Thanks to Finn Bock for finding these.	2000-06-10 09:32:51 +00:00
Jeremy Hylton	a251ea0680	the PyDict_SetItem does not borrow a reference, so we need to decref reported by Mark Hammon	2000-06-09 16:20:39 +00:00
Andrew M. Kuchling	cb95a1470a	Patch from Michael Hudson: improve unclear error message	2000-06-09 14:04:53 +00:00

1 2 3 4 5 ...

757 Commits