verilator

mirror of https://github.com/verilator/verilator.git synced 2025-01-01 04:07:34 +00:00

Author	SHA1	Message	Date
Wilson Snyder	a41ea180fa	Fix +: and -: on unpacked arrays. (#2304 )	2020-05-04 19:40:50 -04:00
Geza Lore	8afcd67a1f	Fix FST tracing of little endian vectors	2020-05-03 22:39:45 +01:00
Wilson Snyder	16e258eb4c	Commentary: reformat changes, relocate announcements	2020-05-03 16:10:02 -04:00
Wilson Snyder	d6fbbddac9	devel release	2020-05-03 11:18:53 -04:00
Wilson Snyder	9dc65df982	Version bump	2020-05-03 11:14:37 -04:00
Wilson Snyder	8f64e4a76f	Support $root, #2150 .	2020-05-02 08:29:20 -04:00
Wilson Snyder	5ded80cf79	Fix MacOs Homebrew by removing default LIBS, #2298 .	2020-04-30 19:53:21 -04:00
Wilson Snyder	a6deee2083	Fix clock enables with bit-extends, #2299 .	2020-04-30 19:22:58 -04:00
Wilson Snyder	e1f45dd84a	Commentary	2020-04-30 18:52:32 -04:00
Wilson Snyder	9fd4541069	Fix reduction OR on wide data, broke in v4.026, #2300 .	2020-04-30 17:53:54 -04:00
Wilson Snyder	15ad3f46be	Fix logical not optimization with empty begin, #2291 .	2020-04-28 21:15:20 -04:00
Wilson Snyder	910803e6db	Fix error on unpacked connecting to packed, #2288 .	2020-04-27 18:38:54 -04:00
Wilson Snyder	87e1c36e4a	Support event data type (with some restrictions).	2020-04-25 15:37:46 -04:00
Geza Lore	c52f3349d1	Initial implementation of generic multithreaded tracing (#2269 ) The --trace-threads option can now be used to perform tracing on a thread separate from the main thread when using VCD tracing (with --trace-threads 1). For FST tracing --trace-threads can be 1 or 2, and --trace-fst --trace-threads 1 is the same a what --trace-fst-threads used to be (which is now deprecated). Performance numbers on SweRV EH1 CoreMark, clang 6.0.0, Intel i7-3770 @ 3.40GHz, IO to ramdisk, with numactl set to schedule threads on different physical cores. Relative speedup: --trace -> --trace --trace-threads 1 +22% --trace-fst -> --trace-fst --trace-threads 1 +38% (as --trace-fst-thread) --trace-fst -> --trace-fst --trace-threads 2 +93% Speed relative to --trace with no threaded tracing: --trace 1.00 x --trace --trace-threads 1 0.82 x --trace-fst 1.79 x --trace-fst --trace-threads 1 1.23 x --trace-fst --trace-threads 2 0.87 x This means FST tracing with 2 extra threads is now faster than single threaded VCD tracing, and is on par with threaded VCD tracing. You do pay for it in total compute though as --trace-fst --trace-threads 2 uses about 240% CPU vs 150% for --trace-fst --trace-threads 1, and 155% for --trace --trace threads 1. Still for interactive use it should be helpful with large designs.	2020-04-21 23:49:07 +01:00
James Hanlon	97cbc10925	Add --flaten for use with --xml-only (#2270 ).	2020-04-21 18:14:08 -04:00
Wilson Snyder	39d7cbf412	Fix arrayed instances connecting to slices, #2263 .	2020-04-17 19:30:53 -04:00
Wilson Snyder	d4f7f5297a	Support IEEE time units and time precisions, #234 . (#2253 ) Includes `timescale, $printtimescale, $timeformat. VL_TIME_MULTIPLIER, VL_TIME_PRECISION, VL_TIME_UNIT have been removed and the time precision must now match the SystemC time precision. To get closer behavior to older versions, use e.g. --timescale-override "1ps/1ps".	2020-04-15 19:39:03 -04:00
Wilson Snyder	9b8aebb00c	Commentary on --build	2020-04-15 18:08:37 -04:00
Geza Lore	dc5c259069	Improve tracing performance. (#2257 ) * Improve tracing performance. Various tactics used to improve performance of both VCD and FST tracing: - Both: Change tracing functions to templates to take variable widths as template parameters. For VCD, subsequently specialize these to the values used by Verilator. This avoids redundant instructions and hard to predict branches. - Both: Check for value changes via direct pointer access into the previous signal value buffer. This eliminates a lot of simple pointer arithmetic instructions form the tracing code. - Both: Verilator provides clean input, no need to mask out used bits. - VCD: pre-compute identifier codes and use memory copy instead of re-computing them every time a code is emitted. This saves a lot of instructions and hard to predict branches. The added D-cache misses are cheaper than the removed branches/instructions. - VCD: re-write the routines emitting the changes to be more efficient. - FST: Use previous signal value buffer the same way as the VCD tracing code, and only call the FST API when a change is detected. Performance as measured on SweRV EH1, with the pre-canned CoreMark benchmark running from DCCM/ICCM, clang 6.0.0, Intel i7-3770 @ 3.40GHz, and IO to ramdisk: +--------------+---------------+----------------------+ \| VCD \| FST \| FST separate thread \| \| (--trace) \| (--trace-fst) \| (--trace-fst-thread) \| ------------+-----------------------------------------------------+ Before \| 30.2 s \| 121.1 s \| 69.8 s \| ============+==============+===============+======================+ After \| 24.7 s \| 45.7 s \| 32.4 s \| ------------+--------------+---------------+----------------------+ Speedup \| 22 % \| 256 % \| 215 % \| ------------+--------------+---------------+----------------------+ Rel. to VCD \| 1 x \| 1.85 x \| 1.31 x \| ------------+--------------+---------------+----------------------+ In addition, FST trace size for the above reduced by 48%.	2020-04-14 00:13:10 +01:00
Geza Lore	8b2666cd04	Fix to make trace code allocation dense. (#2250 ) This looks like a bits/bytes bug. The affected m_codeInc member determines how many 32-bit words to allocate in a buffer used to store previous values of the signal, but this was off by a factor of 8, so we used to use too much memory. SweRV VCD tracing speed +6.5% (excluding IO, clang 6.0), due mainly to reduced D cache misses.	2020-04-11 16:00:43 +01:00
Geza Lore	05f213c266	VCD tracing speed improvements (#2246 ) * Don't inline VCD dump functions Improves model speed with tracing. Measured on SweRW cmark: - GCC 5.5 ~3% faster - Clang 6.0 ~12% faster (!) * Remove redundant test from VCD bit tracing. Improves model speed with tracing. Measured on SweRW cmark: - GCC 5.5 ~7.5% faster - Clang 6.0 ~1.5% faster	2020-04-09 08:19:26 -04:00
Geza Lore	0f617988d4	Compile fast tracing code with OPT_FAST in single compile mode. (#2245 ) When using the __ALL*.cpp based single compile mode (i.e.: without VM_PARALLEL_BUILDS), the fast path tracing code used to be included in __Allsup.cpp, which was compiled with OPT_SLOW, severely harming tracing performance. We now have __ALLfast.cpp and __ALLslow.cpp instead of __ALLcls.cpp and __ALLsup.cpp, so we can compile the fast support code with OPT_FAST as well.	2020-04-08 21:05:43 -04:00
Geza Lore	991d8b178b	Fix FST tracing performance by removing std::map from hot path. (#2244 ) This patch eliminates a major piece of inefficiency in FST tracing support, by using an array to lookup fstHandle values corresponding to trace codes, instead of a tree based std::map. With this change, FST tracing is now only about 3x slower than VCD tracing. We do require more memory to store the symbol lookup table, but the size of that is still small, for the speed benefit.	2020-04-08 17:54:35 -04:00
Wilson Snyder	914a6edd33	Add error if use SystemC 2.2 and earlier (pre-2011) as is deprecated.	2020-04-07 19:58:17 -04:00
Geza Lore	0cfa828572	Fix DPI import/export to be standard compliant, #2236 .	2020-04-07 19:07:47 -04:00
Wilson Snyder	efaf375887	Configuring with ccache present now defaults to using it; see OBJCACHE in the manual.	2020-04-05 16:10:33 -04:00
Wilson Snyder	a494ad5ec7	Support $ferror, #1638 .	2020-04-05 11:22:05 -04:00
Wilson Snyder	e55338f927	Support $fflush without arguments, #1638 .	2020-04-05 10:11:28 -04:00
Wilson Snyder	6eadb8e771	Add simplistic class support with many restrictions, see manual, #377 .	2020-04-05 09:30:23 -04:00
Wilson Snyder	b2228afd1a	devel release	2020-04-04 08:50:56 -04:00
Wilson Snyder	d3797ade95	Version bump	2020-04-04 08:40:47 -04:00
Wilson Snyder	27516b565d	Commentary	2020-04-03 20:08:24 -04:00
Wilson Snyder	c288a7bfb9	Add GCC10-style line number prefix when showing source text for errors.	2020-04-03 20:07:46 -04:00
Marco Widmer	7f9aa057bf	Support split_var in vit files (#2219 )	2020-04-03 08:08:23 -04:00
Wilson Snyder	19abce5535	Suppress REALCVT for whole real numbers.	2020-04-01 18:43:53 -04:00
Wilson Snyder	e6beab4037	Fix implicit conversion of floats to wide integers.	2020-03-31 20:42:07 -04:00
Wilson Snyder	5c72f01598	Fix assertions with unique case inside, #2199 .	2020-03-30 18:13:51 -04:00
Wilson Snyder	4145a38c47	Fix duplicate typedefs in generate for, #2205 .	2020-03-26 18:10:20 -04:00
Wilson Snyder	590b1853d0	Fix packages as enum base types, #2202 .	2020-03-24 17:57:12 -04:00
Wilson Snyder	08a51e3e09	Fix VCD open with empty filename, #2198 .	2020-03-24 17:32:47 -04:00
Wilson Snyder	1e0e51edd3	Fix parameter type redeclaring a type, #2195 .	2020-03-21 12:13:55 -04:00
Wilson Snyder	1ce360ed5b	Add SPDX license identifiers. No functional change.	2020-03-21 11:24:24 -04:00
Wilson Snyder	5f63b24c50	Change --quiet-exit to also suppress 'Exiting due to N errors'.	2020-03-15 08:09:51 -04:00
Wilson Snyder	81c659957e	Add column numbers to errors and warnings.	2020-03-14 22:02:42 -04:00
Wilson Snyder	8ccc17f30b	Add setting VM_PARALLEL_BUILDS=1 when using --output-split, #2185 .	2020-03-08 09:03:29 -04:00
Wilson Snyder	9392eac6a7	devel release	2020-03-08 08:40:33 -04:00
Wilson Snyder	95c4b6aaba	Version bump	2020-03-08 08:38:53 -04:00
Wilson Snyder	2d52f525c5	Add --structs-packed for forward compatibility, #1541 .	2020-03-07 10:51:06 -05:00
Wilson Snyder	e70cba77e6	Add support for dynamic arrays, #379 .	2020-03-07 10:24:27 -05:00
Wilson Snyder	75ecad591a	Implement $displayb/o/h, $writeb/o/h, etc, Closes #1637 .	2020-03-05 21:49:25 -05:00

1 2 3 4 5 ...

1414 Commits