Thanks! That updated config file eliminates the segfault, but then the
results are similar to the example machine configuration (the
"heterogeneous" machine). Here's my simulation output:
PTLCALL type PTLCALL_ENQUEUE
MARSSx86::Command received : -run
Completed 0 cycles, 0 commits: 0 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 85000 cycles, 0 commits: 424705 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 171000 cycles, 0 commits: 426957 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 257000 cycles, 0 commits: 427504 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 342000 cycles, 0 commits: 424877 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 428000 cycles, 0 commits: 427332 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 513000 cycles, 0 commits: 424293 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 599000 cycles, 0 commits: 427482 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 685000 cycles, 0 commits: 427210 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 771000 cycles, 0 commits: 426081 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 857000 cycles, 0 commits: 427577 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 942000 cycles, 0 commits: 423602 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
Completed 1028000 cycles, 0 commits: 427690 Hz,
0 insns/sec: rip ffffffff815e0b52 ffffffff81037f7b ffffffff81037f7b
ffffffff81037
f7b 00000000004149bf ffffffff81037f7b ffffffff81037f7b
ffffffff81037f7b[vcpu 0]
thread 0: WARNING: At cycle 1048577, 0 user commits: no instructions have
committed for 1048577 cycles; the pipeline could be deadlocked
qemu-system-x86_64: ptlsim/build/core/ooo-core/ooo.cpp:876: bool
ooo::OooCore::runcycle(void*): Assertion `0' failed.
Aborted (core dumped)
Any idea why these two configurations fail?
On Tue, Jul 3, 2012 at 11:57 AM, Furat Afram <[email protected]> wrote:
> Hi
> Try the attached xeon configuration.
> -Fuart
>
>
> On Tue, Jul 3, 2012 at 2:33 PM, Peter Hornyack <[email protected]>wrote:
>
>> Yes, in the steps that I included I just removed the default config files
>> to avoid repeat definition errors from scons.
>>
>> I've attached a tar.gz file containing the config files and simulation
>> log files for two machines: "heterogeneous", which comes from the Marss
>> example machine configuration webpage, and "xeon", which I created myself
>> to try to simulate an 8-core machine (so it must be built with scons c=8).
>> If the attachment is too large, I can post these on the web as well. This
>> is the output that I see when I run a simulation with "-machine xeon":
>>
>> MARSSx86::Command received : -run
>> Completed 0 cycles, 0 commits: 0 Hz,
>> 0 insns/sec: rip ffffffff81037f7b ffffffff81037f7b ffffffff81037f7b
>> ffffffff81037f7b 00000000004149bf ffffffff81037f7b ffffffff81037f7b
>> ffffffff81037f7bSegmentation fault (core dumped)
>>
>> Here's the backtrace from the core file (after building with scons c=8
>> debug=1):
>>
>> Core was generated by `qemu/qemu-system-x86_64 -hda ...'.
>> Program terminated with signal 11, Segmentation fault.
>> #0 0x00000000005af1ef in
>> Memory::CoherentCache::MESILogic::complete_request (
>> this=0x392d530, queueEntry=0x39086f8, message=...)
>> at ptlsim/build/cache/mesiLogic.cpp:305
>> 305 message.arg);
>> (gdb) thread apply all bt
>>
>> Thread 2 (Thread 0x7f7417db3700 (LWP 5801)):
>> #0 0x00007f7473b410fe in pthread_cond_timedwait@@GLIBC_2.3.2 ()
>> from /lib/x86_64-linux-gnu/libpthread.so.0
>> #1 0x00000000004ba51a in cond_timedwait (ts=0x7f7417db2e60,
>> mutex=0xe3bd00,
>> cond=0xe3bd60) at qemu/posix-aio-compat.c:104
>> #2 aio_thread (unused=<optimized out>) at qemu/posix-aio-compat.c:325
>> #3 0x00007f7473b3ce9a in start_thread ()
>> from /lib/x86_64-linux-gnu/libpthread.so.0
>> #4 0x00007f74729f24bd in clone () from /lib/x86_64-linux-gnu/libc.so.6
>> #5 0x0000000000000000 in ?? ()
>>
>> Thread 1 (Thread 0x7f7475c39740 (LWP 5800)):
>> #0 0x00000000005af1ef in
>> Memory::CoherentCache::MESILogic::complete_request (
>> this=0x392d530, queueEntry=0x39086f8, message=...)
>> at ptlsim/build/cache/mesiLogic.cpp:305
>> #1 0x0000000000595c0a in
>> Memory::CoherentCache::CacheController::complete_request (this=0x3908490,
>> message=..., queueEntry=0x39086f8)
>> at ptlsim/build/cache/coherentCache.cpp:386
>> #2 0x00000000005975c8 in
>> Memory::CoherentCache::CacheController::handle_lower_interconnect
>> (this=0x3908490, message=...)
>> at ptlsim/build/cache/coherentCache.cpp:230
>> #3 0x0000000000597817 in
>> Memory::CoherentCache::CacheController::handle_interconnect_cb
>> (this=0x3908490, arg=0x2fb7280)
>> at ptlsim/build/cache/coherentCache.cpp:412
>> #4 0x000000000058d3f2 in
>> superstl::TFunctor1<Memory::Controller>::operator() (
>> this=<optimized out>, arg=<optimized out>) at
>> ptlsim/lib/superstl.h:3950
>> #5 0x00000000005ffd65 in superstl::Signal::emit (this=<optimized out>,
>> arg=<optimized out>) at ptlsim/build/lib/superstl.cpp:1431
>> #6 0x00000000005b668e in
>> Memory::SplitPhaseBus::BusInterconnect::data_broadcast_completed_cb
>> (this=0x396d3e0, arg=0x396d648)
>> at ptlsim/build/cache/splitPhaseBus.cpp:507
>> #7 0x00000000005b92e0 in
>> superstl::TFunctor1<Memory::SplitPhaseBus::BusInterconnect>::operator()
>> (this=<optimized out>, arg=<optimized out>)
>> at ptlsim/lib/superstl.h:3950
>> #8 0x00000000005ffd65 in superstl::Signal::emit (this=<optimized out>,
>> arg=<optimized out>) at ptlsim/build/lib/superstl.cpp:1431
>> #9 0x00000000005aa703 in execute (this=0x2fc9028)
>> at ptlsim/cache/memoryHierarchy.h:110
>> #10 Memory::MemoryHierarchy::clock (this=0x2fb5ff0)
>> at ptlsim/build/cache/memoryHierarchy.cpp:106
>> #11 0x000000000067c4d7 in BaseMachine::run (this=0x1268320, config=...)
>> at ptlsim/build/sim/machine.cpp:258
>> #12 0x000000000068b534 in ptl_simulate () at
>> ptlsim/build/sim/ptlsim.cpp:1357
>> #13 0x000000000057f902 in sim_cpu_exec () at qemu/cpu-exec.c:310
>> #14 0x000000000041ffe5 in main_loop () at qemu/vl.c:1450
>> #15 main (argc=7, argv=0x7fffb576a238, envp=<optimized out>) at
>> qemu/vl.c:3189
>>
>> I also tried making some adjustments to that xeon config file (like using
>> p2p connections between L2/L3 and L3/MEM instead of split_bus), but these
>> didn't seem to help. As I said, I'm not sure if I'm doing something wrong
>> or if there's a problem in Marss; if possible, it would be helpful to get a
>> working machine configuration with L3 cache to start from. Please let me
>> know if I can provide any other information.
>>
>> Thanks,
>> Peter
>>
>>
>> On Tue, Jul 3, 2012 at 8:37 AM, Furat Afram <[email protected]>wrote:
>>
>>> Hi
>>> You don't need to remove any files, MARSS complies with all
>>> the available machines then you can choose any in run time using "
>>> -machine".
>>> Can you attach the log files in both cases (you own configuration and
>>> example configuration) and your the machine configuration you created
>>> Thanks
>>> -Furat
>>> On Mon, Jul 2, 2012 at 11:11 PM, Peter Hornyack
>>> <[email protected]>wrote:
>>>
>>>> Hello,
>>>>
>>>> I've been using Marss for a few days and would like to simulate a more
>>>> sophisticated machine than those that are included in the original
>>>> default.conf configuration. I found the "Machine Configuration" page
>>>> on the Marss website that describes how to edit the machine
>>>> configuration files:
>>>> http://marss86.org/index.php?title=Machine_Configuration. However,
>>>> when using the example configuration from the bottom of that web page,
>>>> or any of my own created machine configurations that try to use an L3
>>>> cache, I am getting errors from Marss. These are the steps to
>>>> reproduce the problem (on my Core 2 Duo machine running Ubuntu 12.04):
>>>>
>>>> > git clone git://github.com/avadhpatel/marss.git
>>>> > cd marss
>>>> > rm -f config/atom_core.conf config/default.conf config/moesi.conf
>>>> config/ooo_core.conf
>>>> > edit config/example.conf:
>>>> Copy example configuration from bottom of this page:
>>>> http://marss86.org/index.php?title=Machine_Configuration
>>>> > scons c=2
>>>> > edit test.cfg:
>>>> -machine heterogeneous
>>>> -bench-name test
>>>> -stats test.stats
>>>> -logfile test.log
>>>> -loglevel 10
>>>> > qemu/qemu-system-x86_64 -hda /path/to/ubuntu-kvm-natty-amd64.raw -m
>>>> 1024 -simconfig test.cfg
>>>>
>>>> My disk image contains a 64-bit Ubuntu 11.04 distribution, and I see
>>>> the output "Simulator is now waiting for a 'run' command" in my
>>>> terminal. In the emulated system I now run a program that switches to
>>>> simulation mode, and I see the following output:
>>>>
>>>> PTLCALL type PTLCALL_ENQUEUE
>>>> MARSSx86::Command received : -run
>>>> Completed 0 cycles, 0 commits: 0 Hz,
>>>> 0
>>>> Completed 461000 cycles, 0 commits: 2302454 Hz,
>>>> 0
>>>> Completed 927000 cycles, 0 commits: 2329693 Hz,
>>>> 0
>>>> insns/sec: rip ffffffff8109c080 ffffffff8100c980[vcpu 0] thread 0:
>>>> WARNING: At cycle 1048577, 0 user commits: no instructions have
>>>> committed for 1048577 cycles; the pipeline could be deadlocked
>>>> qemu-system-x86_64: ptlsim/build/core/ooo-core/ooo.cpp:876: bool
>>>> ooo::OooCore::runcycle(void*): Assertion `0' failed.
>>>> Aborted (core dumped)
>>>>
>>>> If I perform the same steps but don't remove the default config files
>>>> and use the default "shared_l2" or "private_L2" machines, then the
>>>> simulation runs fine with my test program. I have also created a
>>>> different machine configuration with an L3 cache (attempting to
>>>> simulate an 8-core Intel Xeon processor) that causes a segfault in
>>>> Marss during the simulation.
>>>>
>>>> I'm not sure if this issue is a bug in Marss or a problem due to a bad
>>>> machine configuration. If somebody can take a look and offer any
>>>> advice, that would be great. The steps that I've included should
>>>> hopefully make it easy to reproduce this issue, but I can gladly post
>>>> my example.conf file, my other 8-core config file, test.log output, or
>>>> anything else that would be helpful.
>>>>
>>>> Also, if anybody has a working machine configuration with an L3 cache
>>>> or an 8-core configuration and can post it, that would also be
>>>> excellent (I looked around the mailing list archives a bit for
>>>> something like this, but failed to find anything); if I can at least
>>>> get my hands on a configuration that works, then I can hopefully tweak
>>>> it to something close to the processor that I'm trying to simulate.
>>>>
>>>> Thanks,
>>>> Peter
>>>>
>>>> _______________________________________________
>>>> http://www.marss86.org
>>>> Marss86-Devel mailing list
>>>> [email protected]
>>>> https://www.cs.binghamton.edu/mailman/listinfo/marss86-devel
>>>>
>>>
>>>
>>
>
_______________________________________________
http://www.marss86.org
Marss86-Devel mailing list
[email protected]
https://www.cs.binghamton.edu/mailman/listinfo/marss86-devel